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INTRODUCTION TO NEUTROSOPHIC STATISTICS 


Introduction 


Although the neutrosophic statistics has been 
defined since 1996, and published in the 1998 book 
Neutrosophy. / Neutrosophic Probability, Set, 
and Logic, it has not been developed since now. A 
similar fate had the neutrosophic probability that, 
except a few sporadic articles published in the 
meantime, it was barely developed in the 2013 book 


“Introduction to Neutrosophic Measure, 
Neutrosophic Integral, and Neutrosophic 
Probability”. 


Neutrosophic Statistics is an extension of the 
classical statistics,and one deals with set values 
instead of crisp values. 

In most of the classical statistics equations and 
formulas, one simply replaces several numbers by 
sets. And consequently, instead of operations with 
numbers, one uses operations with sets. One 
normally replaces the parameters that are 
indeterminate (imprecise, unsure, and _ even 
completely unknown). That’s why we made the 
convention that any number a that is replaced bya 
set be noted ay, meaning neutrosophic a, or 
imprecise, indeterminate a. ay can be a neighbour of 
a, can be an interval that includes a, and in general 
it can be any set that approximates a. In the worst 
scenario, @y can be unknown. In the best scenario 
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(when there is not indeterminacy related to a), an = 
a. 

Why this passage from crisp numbers to sets? 
Because in our real life we cannot always compute 
or provide exact values to the _ statistics 
characteristics, but we need to approximate them. 
This is one way to passing from classical to 
neutrosophic statistics, but other ways could be 
possible, depending on the types of 
indeterminacies, and the reader is kindly invited to 
do such research to be published in the next issues 
of the international journal of “Neutrosophic Sets 
and Systems”, http://fs.gallup.unm.edu/NSS/. 

The author would like to thank Prof. Yoshio 
Hada, the President of Okayama University of 
Science, Prof. Valery Kroumov from Okayama 
University of Science, Prof. Akira Inoue from the 
State University of Okayama, also Prof. Masahiro 
Inuiguchi, Dr.Masayo Tsurumi, and Dr. Yoshifumi 
Kusuroku from the University of Osaka, and Dr. 
Tomoe Entani from the Hyogo University for their 
valuable considerations and opinion during my 
postdoctoral research in Japan in December 2013 
and January 2014 about applications of the 
neutrosophic science in robotics and other fields. 

Any quantity computed with some indeterminacy 
from values in a sample (i.e. not exactly) is a 
neutrosophic statistics. 

A neutrosophic statistic is a random variable and 
as such has a neutrosophic probability distribution. 
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The long-run behaviour of a neutrosophic statistic’s 
values is described when one computes this 
statistic for many different samples, each of the 
same size. 

Neutrosophic Statistics is an extension of the 
classical statistics. While in classical statistics the 
data is known, formed by crisp numbers, in 
neutrosophic statistics the data has some 
indeterminacy. 

In the neutrosophic statistics, the data may be 
ambiguous, vague, imprecise, incomplete, even 
unknown. Instead of crisp numbers used in 
classical statistics, one uses sets (that respectively 
approximate these crisp numbers) in neutrosophic 
statistics. 

Also, in neutrosophic statistics the sample size 
may not be exactly known (for example the sample 
size could be between 90 and 100; this may happen 
because, for example, the statistician is not sure 
about 10 sample individuals if they belong or not to 
the population of interest; or because the 10 sample 
individuals only partially belong to the population 
of interest, while partially they don’t belong). 

In this example, the neutrosophic sample size is 
taken as an interval n = [90, 100], instead of a crisp 
number n = 90 (or n = 100) as in classical statistics. 

Another approach would be to only partially 
consider the data provided by the 10 sample 
individuals whose membership to the population of 
interest is only partial. 
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Neutrosophic Statistics 


Neutrosophic Statistics refers to a set of data, 
such that the data or a part of it are indeterminate 
in some degree, and to methods used to analyze the 
data. 
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In Classical Statistics all data are determined; 
this is the distinction between neutrosophic 
statistics and classical statistics. 

In many cases, when indeterminacy is zero, 
neutrosophic statistics coincides with classical 
statistics. 

We can use the neutrosophic measure for 
measuring the indeterminate data. 

The neutrosophic statistical methods will enable 
us to interpret and organize the neutrosophic data 
(data that may have some indeterminacies) in order 
to reveal underlying patterns. 

There are many approaches that can be used in 
neutrosophic statistics. We present several of them 
through examples, and afterwards generalizations 
for classes of examples. Yet, the reader can invent 
new approaches as well in_ studying’ the 
neutrosophic statistics. 

We emphasize, as in neutrosophic probability, 
that indeterminacy is different from randomness. 
While classical statistics is referring to randomness 
only, neutrosophic statistics is referring to both 
randomness and especially indeterminacy. 

Neutrosophic Descriptive Statistics is 
comprised of all techniques to summarize and 
describe the mneutrosophic numerical data’s 
characteristics. 

Since neutrosophic numerical data contain 
indeterminacies, the neutrosofic line graphs, and 
neutrosophic histograms are represented in 3D- 
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spaces, instead of 2D-spaces as in classical 
statistics. The third dimension, in addition of the 
XOY Cartesian System, is that of indeterminacy (I). 
From unclear graphical data displays we can 
extract neutrosophic (unclear) information. 

Neustrosophic Inferential Statistics consists of 
methods that permit the generalization from a 
neutrosophic sampling to a population from which 
it was selected the sample. 

Neutrosophic Data is the data that contains 
some indeterminacy. 

Similarly to the classical statistics it can be 
classified as: 

- discrete neutrosophic data, if the values 
are isolated points;for example: 6+ i,, where i, € 
[0,1],7, 26+i,, where i, € [3,5]; 

- and continuous neutrosophic data, if the 
values form one or more intervals, for example: 
[0,0.8] or [0.1, 1.0] (i.e. not sure which one). 

Another classification: 

- quantitative (numerical) neutrosophic 
data; for example: a number in the interval [2, 5] 
(we do not know exactly), 47, 52, 67 or 69 (we do 
not know exactly); 

- and qualitative (categorical) neutrosophic 
data; for example: blue or red (we don’t know 
exactly), white, black or green or yellow (not 
knowing exactly). 

Also, we may have: 
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- univariate neutrosophic data, i.e. neutro- 
sophic data that consists of observations on a 
neutrosophic single attribute; 

- and multivariable neutrosophic data, i.e. 
neutrosophic data that consists of observations on 
two or more attributes. 

As a particular cases we mention the bivariate 
neutrosophic data, and trivariate neutrosophic 
data. 

A Neutrosopical Statistical Number N has the 
form: 

N=d+i, 
where d is the determinate (sure) part of N, and tis 
the indeterminate (unsure) part of N. 

For example, a = 5 + i, where 
i € [0, 0.4], is equivalent toa € [5,5.4], so for surea=>5 
(meaning that the determinate part of ais 5), while 
the indeterminate part i¢€[0,0.4] means the 
possibility for number ,a” to be a little bigger than 
oy 

We may consider, similarly to the classical 
statistics, aneutrosophic stem-and-leaf display of 
data. 

For example, let’s have the neutrosophic data 
that follows: 

6 + i,, withi, € (0, 0.2); 
7 + iywithi, € [2, 3]; 
6 + iz, withiz € [0, 1]; 
9+ i,,withi, € [1.1, 1.5); 
9+ iy. 
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Its neutrosophic stem-and-leaf display is: 


6]ft1 ts 
7\|_ le 
Olli, U4 
or under the form of interval: 
6) (0,0.2) [0,1] 
7 [2, 3] 
911(0,0.2) [1.1,1.5] 


Obviously a neutrosophic statistic number can 
be written in many ways. 

If you retake: a = 5 +i, withi € [0,0.4], then 
a=4+i,, with i € [1,1.4], ora = 3 + iz, with 
iz € [2,2.4], and in generala =« + ix, with 
ix € [5—«,5.4—«],and « any real number. 

Or in opposite way: 

a = 5.4 —i3, with iz € [0, 0.4], 
and in general 

a= 6 —ig, with ig € [6 —5.4,6 —5],andf any real 
number. 

A Neutrosophic Frequency Distribution is a 
table displaying the categories, frequencies, and 
relative frequencies with some indeterminacies. 
Most often, indeterminacies occur due to imprecise, 
incomplete or unknown data related to frequency. 
As a consequence, relative frequency becomes 
imprecise, incomplete, or unknown too. 

An example about the neutrosophic frequency 
distribution concerning the number of accidents by 
car drivers. 
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Number of suiidere New userhi: frequency Nevirvcouptk ceative frequercy 
i) Y (0185,0227 
i (a0, 80) (0240, 0337 
é (70, 90} (O250, 0375) 
j (40, $c [01540217 
“nel Oo >| , Pye Pi wt f OLa5 9, a 15 4 ] 


How to read the previous table, let’s say line #2: 
the number of car drivers with only one accident is 
between 60 and 80 (thus unclear information), and 
corresponding mneutrosophic relative frequency 
between 0.2240 and 0.333. 

To compute the total for the neutrosophic 
frequencies, where we have imprecise information, 
we compute the min and max of estimated 
frequencies: 

Minne = 50+ 60+ 70 + 40 = 220, 
andmax,r = 50+ 80+ 90+ 50 = 270. 

To compute the neutrosophic relative frequency, 
we also do the min and max of all possibilities. 

For zero accidents: 


. 50 
MINnr f = 270 ~ 0.185 


50 
andmaxn;f = 20 a 0.227, 
or 50 + [220,270] = [0.185, 0.227]. 
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For one accident one has: 


60 
fi ieee ae AG 
™™nrt =~ SO 460 +90 +50 
80 
MAX ppg = ————_-———- © 0.333. 


50+ 80+ 70+ 40 
For two accidents one has: 


70 
Pe ee ee 1) 
™nrt =~ SO 4 80 +70 450 


90 
d Sia 375, 
ru Ty, 5 BQ GQ OOP AG 


The interval [0.280, 0.375] is different from: 
[70, 90] + [220,270] = Foe ~ [0,259, 0.409]. 
For three accidents one has: 


40 
eee SC 
™Tnrf = 59480 +90 + 40 


50 
d STF 
cc ecaacia 5 Maat 1, Ter, Ber, TG 


and similarly the interval [0.154, 0.217] is 
different from: 
40 50 
270’ 220 

We simply cumulated the neutrosophic relative 
frequencies as an addition of intervals: 

(0.185, 0.227] + [0.240, 0.333] + [0.280, 0.375] 
+ [0.154, 0.217] = [0.859, 1.152]. 


[40, 50] + [220,270] = | ~ [0.148, 0.227]. 


Neutrosophic Statistical Graphsare graphs 
that have indeterminate (unclear, vague, 
ambiguous, unknown) data or curves. 

1.a. Neutrosophic Bar Graph: 
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Table: Time spent by an American daily 
T=watching TV: between [4,5] hours; 
B=reading books: between [1,2] hours; 
D=driving: between [1,3] hours; 
S=sleeping: between [6,9] hours. 


2.a. Neutrosophic Circle Graph for the same 
example: 
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3.a. Neutrosophic Double Line Graph for the 
same example: 


| Fon Noms 9 


NU V 
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4.a. Neutrosophic Line Plot for the same 
example: 


x = one hour 


= one possible hour 
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5.a. Neutrosophic Pictograph for the same 
example: 


| | | | | [ | | 
aaa eee 


Green color rectangle: one hour 
Red color rectangle: one possible hour 


6.a. Neutrosophic 2D Histogram is a neutro- 
sophic bar graph such that the bars are vertical, 
there is no gap between bars (the bars of height zero 
are also included), and the width of each bar has 
the size of the represented interval. It shows, within 
a certain interval, the approximate number of times 
data occur. 
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Neutrosophic distribution of 
family income frequencies 


i 
wo 


eRe 
a 


wo 


Frequency of families (in millions) 
~—I 


10-29 30-49 50 - 69 70-89 90-109 110 - 129 


Income (in thousand $) per year 


where ~— indicates in the numbering scale a 
distortion. 

The frequencies are not crisp numbers as in 
classical statistics, but between some limits. For 
example, the number of families with income in 
between $10,000 — $29,000 is between 7 and 9 
millions of families. Similarly for other classes of 
income, except for the last class of income in 
between $110,000 — $129,000 that corresponds to 
a crisp number: 1 million of families. 


19 
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We represented all types of mneutrosophic 
statistical graphs in a space of dimension two (2D) 
as in classical statistics, but it is also possible to 
make the graphs in a space o dimension three (3D), 
just adding to each of the previous 2D-graphs an 
indeterminate dimension, which measures the 
indeterminacy of the data. 


1.b. The Neutrosophic 3D Bar Graph 


The deepness axis (i) measures the indeterminacy. 
For the previous example: Time spent by an American daily. 
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2.b. Neutrosophic Cylinder Graph 


The heights (that represent indeterminacies) of T 
and B are the same, while the height of D is double, 
and the height of S is triple. 


3.b. TheNeutrosophic 3D-Line Graph 


2| 
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for the same example. We plot the points of 
coordinates (T, 4, 1), (B, 1, 1), (D, 1, 2), and (S, 6, 
3), where the second component represents the 
determinate part (y) and the third component the 
maximum indeterminacy (j, and connect them. We 
get a 3D curve. 


4.b. Neutrosophic 3D Plot for the same 
example: 
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5.b. Neutrosophic 3D Pictograph for the same 
example: 


6.b. Neutrosophic 3D Histogram for the same 
example of Neutrosophic distribution of family 
income frequencies: 


Neutrosophic distribution of family income frequencies 


Frequency of families (in millions) 


10-29 30-49 50-69 70-89 90-109 110-129 
Income (in thousand $) per year 
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Statistical Deceptions can be expressed in the 
neutrosophic way. For example: 


a) 


b) 


*Company’s heating bill went up to 10% 
last year.” In a neutrosophic way we can 
write: [0, 10]% (which could be any 
number between O and 10, including the 
extremes). 

“We guarantee you lose as much as 15 
pounds in a month, or your money back.” 
Actually you lose [0, 15] pounds, so you 
may lose no pound! 

“No product is better than Brian’s.” This 
means that other products could be the 
same as Brian’s! 


24 
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Neutrosophic Quartiles 


Let’s consider the set of neutrosophic 
observations of a variable listed in almost ascending 
order (since we deal with sets instead of crisp 
numbers we have a partial order). 

The neutrosophic quartiles are similarly as in 
classical statistics defined: the first (lower quartile) 


is the =(n +1)th, the second is the =(n +1)th, and 


the third the =(n +1)th. 


If (n + 1) is not divisible by 4, then one takes the 
average of the two neutrosophic observations whose 
ranks the quartile falls in between. Another 
procedure is to take the inferior integer part of 


= (n+ 1), for i = 1,2,3. 


Let’s compute the midpoint of a set U in the 


following way: 
infU +sup U 
2 
We can define a total order on the n neutrosophic 


observation sets in the following way: 
for any sets U and V we have U<v if 
either midpoint (LI) < midpoint(v), 
loaudeea (LU) = midpoint(V)and min U< minv. 
If it happens that 
midpoint (LU!) = midpoint(v) 
and min U= min V, 
then automatically max U = maxvV, therefore 
U=V. 


midpoint U= 
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An example with n = 12 ascending neutrosophic 
observations: 


1, (2,3),[{4, 6}, 5} [7,10], |[7, 11], 9} 12,[14, [14, 15)} 20, 


(24) W222; 25). 
First quartile: 


1 1 
gmt 1) =7(12 + 1) = 3.25, 


then we average the 3™ and the 4th ranked 
observations: 


(4,6}+5 {445,645} (9,11) _ 


——_—__—_— ee = {4.5,5.5} 
2 2 2 = sa ee tye 
Second quartile: 


2 Z 
— 1) =—(12 + 1) = 6.50, 
z(n+1)=F(12+1) 


then we average the 6t and 7th ranked 
observations: 


pe [Ee AS) = (8,10) 
2 as I a ca 
Third quartile: 


3 3 
— 1) =—-—(12 + 1) = 9.75, 
F(n+1) =F (12+1) 
then we average the 


Oth and 10th ranked 
observations: 
144+[14,15] 14414 14415 


y) = 


a in [14, 14.5). 
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Neutrosophic Sample 


A Neutrosophic Sample is a chosen subset of a 
population, subset that contains some 
indeterminacy: either with respect to several of its 
individuals (that might not belong to the population 
we study, or they might only partially belong to it), 
or with respect to the subset as a whole. While the 
classical samples provide accurate information, the 
neutrosophic samples provide vague or incomplete 
information. 

By language abuse one can say that any sample 
is a neutrosophic sample, since one may consider 
their determinacy equals to zero. 

Neutrosophic Survey Results are survey results 
that contain some indeterminacy. 

A Neutrosophic Population is a population not 
well determined at the level of membership (i.e. not 
sure if some individuals belong or do not belong to 
the population). 

For example, as in the neutrosophic set, a 
generic element x belongs to the neutrosophic 
population M in the following way, x(tif)EM, 
which means: x is t % in the population M, f %x is 
not in the population M, while 1 % the appurtenance 
of x to M is indeterminate (unknown, unclear, 
neutral: neither in the population nor outside). 

Example. Let’s consider the population of a 
country Ci. Most people in this country have only 
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the citizenship of the country, therefore they belong 
100% to C;. But there are people that have double 
citizenships, of countries C; and C2. Those people 
belong 50% to Ci, and 50% to C2. While citizens 
with triple citizenships of countries Ci, C2, and C3 
belong only 33.33% to each country. Of course, 
considering various criteria these percentages may 
differ. Also, there are countries with autonomous 
zones, whose citizens in these zones may not 
entirely consider themselves as belonging to those 
countries. 

But there is another category of people that have 
been stripped from their C; citizenship for political 
reasons and they have other citizenship, while still 
living (temporarily) in C;.They are called paria, and 
they do not belong to C; (not having citizenship), but 
still belong to C; (because they still living in C). 
They form the indeterminate part of neutrosophic 
population of country C1. 

A simple random neutrosophic sample of size 
n from a classical or neutrosophic population is a 
sample of n individuals such that at least one of 
them has some indeterminacy. 

Example. One considers a random sample of 
1,000 homes, in a city of over one million 
inhabitants, in order to investigate how many 
houses have at least a laptop. One finds out that 
600 houses have at least one laptop, 300 houses 
don’t have any laptop, while 100 houses have each 
of them a single laptop, but not working. 
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Some of these 100 house owners tried to have 
their laptop fixed, others said their laptops’ hard 
drives have crashed and it is little chance to fix 
them. Therefore indeterminacy. We have a simple 
random neutrosophic sample of size 100. 

Similarly as in classical statistics, in a stratified 
random neutrosophic sampling the pollster 
groups the (classical or neutrosophic) population by 
a strata according to a classification; afterwards the 
pollster takes a random sample (of appropriate size 
according to a criterion) from each group. If there is 
some indeterminacy, we deal with neutrosophic 
sampling. 

Example. One considers two strata: men and 
women in the city of Gallup, New Mexico. But, since 
women represent 51% of the population and men 
49%, one takes a random sample of 51 women and 
a random sample of 49 men. 

But later learn that ,one” man and two “women” 
are actually transgender. Therefore 3 individuals 
are indeterminate. Whence one has stratified 
random neutrosophic sampling. 

If the (classical or neutrosophic) population is 
divided into subgroups, such that each subgroup is 
representative of the population, and then one 
collects from these subgroups a random sample 
and there is some indeterminacy, then one has a 
neutrosophic cluster sampling. 

Example. Suppose 5 professors conduct PhD 
dissertations in mneutrosophic statistics. Each 
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professor has a number of graduate students, but 
some students are undecided whether to pursue 
their dissertations in classical or neutrosophic 
statistics. The professors represent the clusters. 
One randomly selects 2 professors to interview their 
students about research in neutrosophic statistics. 
But, because some students are undecided 
(indeterminate) with respect to their research topic, 
we have a neutrosophic cluster sampling. 

A convenience sample is likely to be inaccurate 
since the pollster selects a sample of individuals 
that are readily available, who might answer 
randomly to the questions in order to finish faster. 
The less the individuals are interested in the survey 
results, the more likely inaccurate are the survey 
results. While a voluntary-response sample is 
more likely to be biased, since the sample 
individuals may volunteer in purpose to influence 
the survey results. 

Besides these two categories of sample 
individuals there is another one of malicious people 
that might oppositely answer to the questions in 
order to produce false results. 

That’s why data of some sample individuals has 
to be removed, but often we don’t know which ones. 
Therefore, we have indeterminacy related to the 
sample size: how many sample people were from the 
above three categories, and how to depict their data 
in order to remove them from the survey results? 
Again, neutrosophic statistics. 
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Neutrosophic Numerical 
Measures 


Example with Neutrosophic Numbers a+bDI , 
where a, bare real numbers, and Jis indeterminacy, 
such that /*? =] and 0:1 =0. 


Let’s have the neutrosophic numbers: 


=2=47, 14-013 4556471 
Compute their mean: 


3| 
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(-2-41) +(-14+0-DN+@G+5D+(6+7) _ 
SB 


2 eee ee Sate Oa os 
ne rn 
Compute their median: 
(-1+0-1)+(8+5/J) -1+3 0+5 
EE +——J=14+2.5]. 
2 2 2 
Compute the deviation of each neutrosophic 
number with respect to the mean: 
(—2 —4/) — (1.5 + 2/) = —3.5 — 61, 
(-1+0-N-(1.542/) = -2.5—-2I, 
(3 +51) -(1.5+2/) =15+3I, 
(64+ 71) —(1.5+2/) =4.5451. 
Square the deviations: 
(—3.5 — 61)? = (—3.5)* + 2(—3.5)(—6)I + (—6)7/? 
= 12.25 + 42] + 36/7 = 12.25 + 42] + 36] 
= 12.25+ 78! 
(—2.5 — 21)? = 6.254 141 
(1.5 + 31)* = 2.25+ 18] 
(4.5 + 51)? = 20.25 + 701. 
We are following the formula: 
(a+ bl)? = a* + 2abl + b2I? 
= a* + 2abI + b?] 


*2=1.542I. 


or 
(a+ bl)* = a* + (2ab+b7)I. 


Compute the standard deviation: 
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Sz 


= [2.25 + 781) + (6.25 + 14/) + (2.25 + 187) + (20.25 + 701) 
7 4 


= V10.25 + 45]. 

To compute the square root of a neutrosophic 
number we denote the result as x+yl and 
determine x and y: 

V10.25 +45! =x+ yl. 

Raise both sides to the second power: 

10.25 + 45] = x* + (2xy + y7)I. 
Therefore: 


10.25 = x? 

45 =2xy+y" 

Since standard deviation is positive, we take 
x = +Vv10.25 = 3.20 

and replace it into the second equation: 
45 = 2(3.20)y + y? 

and solve for positive y: 
y?+64y—45 =0 

whence 


_ 6.4 + [6.47 — 4(1)(—45) _ 
a gg Te 0.64. 


Therefore, the neutrosophic standard deviation 
of the previous four neutrosophic numbers is 
3.20 + 0.64 I. 
We observe that 3.20 is the classical standard 
deviation of the determinate parts of the previous 
neutrosophic numbers: —2,-—1,3,6; but 0.64 is not 
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the classical standard deviation oof the 
indeterminate parts of the previous neutrosophic 
numbers: —4,0,5, 7. 

The classical standard deviation of the numbers 
-4, 0, 5, 7, whose mean is 2, is: 


4 —2)2+(0- iat (-2?+7-27 4) 


Indeterminacy has propagated when squaring 
the deviations. 


Classical Neutrosophic 
Numbers 


A classical Neutrosophic Number has _ the 
standard form: 
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a+ bl, 
where a, b are real or complex coefficients, and I 
= indeterminacy, such 0-/ = 0 and /* = IJ. 
It results that /” =/ for all positive integer n. 
If the coefficients a and b are real, then a+ bI is 
called Neutrosophic Real Number. 


Examples: 2+ 3/,-—5+ I, etc. 


But if the coefficients a and b be are complex, 
then a+blI is called Neutrosophic Complex 
Number. 

Examples: (5+ 2i)+(2-8i)/,J+i+9I-il, etc. 
where i = ¥—1. 

A neutrosophic complex number can be better 
written as: 

a+ bi+clI + dil, where a, b, c, and dare reals. 

Of course, any real number can be considered, 
by language abuse, a neutrosophic number. 

For example: 

5=5+0-], 

or 

5=5+0-i+0-'1+0-i°1. 

We call it a degenerated neutrosophic number. 

A true neutrosophic number contains the 
indeterminacy J with a non-zero coefficient. 


Division of classical neutrosophic real numbers. 


(a, + b,!) _ (a, + b51) =7 
We denote the result by: 
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a, + byl 
Az + bz! 
then multiply and identify the coefficients: 
a, + byl = (x + yl)(az + bol) 
= xa, + xboI + yagl + ybyI? 
= (aox) + (box + any + boy)l. 
Whence we form an algebraic system of 
equations, by identifying the coefficients: 


=x-+yi, 


AxX = ay 
box + azy + boy = by 
or 
A2X = Ay 
box + (a2 + b2)y = by’ 
One obtains unique solution only when the 
a, 0 
by az,+by 
or az(a, + bz) # 0. Hence a, #0 and a, # —bz are 
that conditions for the division of neutrosophic real 
numbers 


determinant of second order +0 


Ay + b,1 
az + bol 
to exist. 
Then 
ay 
y= — 
a2 
and 
arb, — a,b 
a ar TES 
az (az + bz) 
or 
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ay + b,I _ ay ab, aa a,b 
A, +b2l a, a,(ag+b2) © 
As consequences, we have: 


dailies epee ati pea for k a non-zero real 
ak+bkI k(a+bl) k’ 


number, and fora #0anda# -—b. 


I a 1 
i ae fora #O0anda##-—b. 


3. Divisions by I, -I, and in general by k/, for k 
a real, are undefined. 


a+blI . 
me undefined, for any real k, and any realsa 


and b. 
In particular: 


I 
7 = undefined; 


— = undefined; 


= undefined; 


a+ bl 


= undefined. 


bl b 
4. =< ==+--Lforc # 0; 


5. ey ee fora #0anda#-—b. 
a+blI a a(atb) 


6. =<, for b #0 (the classical division of 


7 a= {42-7 =a +b. 
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0 0 . a-0-0-b 
ee aay for a# 
0and a + —b. 


kI _ ok 
at+tbI atb 


I, for any real kk, anda#0Oanda# 
—b. 
Let’s fo a concrete example by calculation. 
What is (2+ 3/)+(1+4+/) =? 
Denote: 
231 


1+] 


=x-tyl. 
One has: 
14+Daety)=xt+yltxl+ yl? =2+3I1 
x+(x4+2y)l =24 31. 


A os 

Whence , £Iy=3 
or x =2,y = 0.5. 
There 

Aa) ae 

a 2+ 0.51. 
Let’s check: 

24+ 31 

24051 °°! 
Then 


(2+ 0.5I)(x+ yl) = 243, 
2x + (2y + 0,5x + O0.Sy) = 2 + 3. 


Whence 
2hi=2 
0.5x + 2.5y = 3’ 
hence 
ool y=1, 
or 
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fea =1+1-:J/=1+4+/ 
2+0.5] — 7 
Perfect. 
Another example. 
2+ 3! telah 
res) ae 
Whence 
By = 2 
12x + 12y+ 8y =3 
and we get 
Ze. 
x=-=- 
8 4 
and 
1 
12 (=) +20y=3, ory=0. 
Therefore 
it Ae 
8+121 4 A 


which is a neutrosophic simplification since: 
2+3) 1°@+3/) 1 
8412] 4:(24+31D) 4 


Now an example which is undefined: 


Die 3] 
=e 
1-1 
a ae 
hag 


(1-D(xtyl =2431 

x+yl —xI-—yl* =24+3I 
or 

x+(y-x-y)l=24+31 
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or 
x—-xl=2+4+3iI, 
therefore 
x=2 and -—-x=3, 
which is impossible. 


Therefore 
2431 
1-1 
And an example where it results infinitely 
solutions: 


= undefined. 


I 
| 
~ 


Denote 
I 
7> xe yl, 
SO 
Ia + yl) =I, 
or 
xI+yl* =], 
or 
(x+y)l=1-], 


whence x +y=1, where x and y are unknown 
reals. 

We get infinitely many solutions: 

xE€Rand y=1-~x, 

where R& is the set of real numbers. Among 
solutions there are: 

1, I, 2-J, etc. 

But since the division’s result should be unique, 
we say that 
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I 
7 = undefined. 


Root index n > 2 of a neutrosophic real number. 


First let’s compute the square root: 

Va + bl, where a, bare reals. 

Let’s denote: 

Vat+bl=x+yl, 

where x and y are real unknowns, and raise both 
sides to the second power. One gets: 
at+bl =(xt+ yl)? = x? 4+ 2xyl + y7I? = x* 4+ 2xyl + y7! 

=2° Fry tT. 


x*=a 
Whence Se ee 
Hence , x= iva 
y*7+2Va-y—b=0 


and we solve the second equation for y: 
F2VatV4a+4b F2Vat+2Vatb 
—~—— 905. °° ~~ eS 
= FVatVa+b, 
and the four solution are: 
(x,y) = (Va, -Va + Va +b), (Va, -Va - Vat), 
(-Va, Va + Va+b), or(—va, Va 7 Va +b). 
Thus: 
Va+ bl =Va+(-Va+va+ b)I, 
or 
Va — (va +va + b)I, 


or 
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—~Ja+ (Va +Va¥5)), 


or 


—Ja+(Va—-vVa+b)I. 


Let’s consider an example done through all 
calculations: 


V9+7I =? 
Let’s denote: 
V94+71 =x+4+yl. 


Then: 
94+ 7] =x? 4+ 2xyl + y7I? = x* + (2xy + y?)I. 

Whence 

x* =9 orx = +3 

| 2xy+y*=7 
Let’s find y: 
x=3 x=-3 
6y+y72= —6+y%= 
y7+6y-7=0 y*-6y-7=0 
VED = 1S 0 se Ota 
VAT te ea yal 
(3,-7),(3,1) (-3,7),(-3,-1). 
Therefore, V9 + 7] = +3 +1 (four solutions). 
As a particular case we can compute vi. 
Let’s consider VI = x + yl, then 
O+1-1=x*4+(2xyt+y’):I 

and we need to find x and y. 
Whence x* = 0,orx = 0, 
and 2xy+ y* = lory* = 1l,ory = +1. 
Hence VI = +1. 
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Similarly for ‘VI. 

Let’s consider ‘VJ = x + yl, 

or O+1-12 =x" + (SRG hy” *x®) «1, 
wherex” = 0, or x = 0, 


and 
n-1 


> CKyn-kyk = 4, 
k=0 
or y" = 1, whence y = V1 and we get n solutions: 
a real solution y = 1 and n— 1 complex solutions in 
the case we are interested in neutrosophic complex 
solutions as roots index n of 1. 
In the same way, we can compute root index n = 
2 of any neutrosophic number: 
Va—bl=x+yl 
or 
a+ bl =(x+ yl)” 
n-1 
ae ee ( + nee) -[= 
k=0 


n-1 
= xr + (> chy ee 


k=0 

where Ck means combination of n elements taken 
by groups of k elements. 

Whence x = Va if n is odd, or x=+'%Va if n is 
even, 

and 


n-1 
(Sciyend) =, 
k=0 
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and solve it for y. 

When the x and y solutions are real, we get 
neutrosophic real solutions, and when x and y 
solutions are complex, we get neutrosophic complex 
solutions. 

Let a+ bi+cl+dil be a neutrosophic complex 
number, where a, Db, c, d are reals. Let’s compute 
square root of it: 


(Vat+bi+cl + dil) = (x+ yi+ zl + wil)? 
a+bi+cl + dil 
= x* —y?24 2717 + w7i7l? + 2xyi + 2xzI 
+ 2xwil + 2yzil + 2ywi?l + 2zwil? 
=x? -—y24+ 27] —w?I 4 2xyi + 2x2! 
+ 2xwil + 2yzil — 2ywl + 2zwil 
= (x* — y*) + 2xyi 
+ (22 —w? + 2xz — 2yw)I 
+ (2xw + 2yz + 2zw)il. 
Then we get a non-linear algebraic system in four 
variables (x, y, 2, w) and four equations: 
yay =a 
2xy =b 
z* —w* + 2xz— 2yw =c 
2xw + 2yz + 2zw = d. 
In a more general way, we can compute root 
indexn of a neutrosophic complex number: 


ue 
(a+ bi+cl+diln=x+yit+zl+wil, 
where x, y, Z, Ww are variables in the set of real 


numbers. 
Raising to the power n in both sides, one gets: 
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a+bi+cl+dil = («+ yi+zlI + wil)” 
=fioy) t+ hay)it faay,w,z)l 
+ fax y,w, z)il, 
where f;, fo, fz, fy are real functions. 
Whence we get a non-linear algebraic system in 
four variables (x, y, w, z) and four equations: 
fity) =a 
fr(%y) = b 
f3(%y,w,z) =c 
fa(x%y,w,Z) = d, 
that we need to solve. 
Similarly, one can compute square root of a 
complex number. 
Let a + bi, where i = V—1, and a, b are reals, be a 
complex number. 
Va+bi=x+yi such that (x + yi)? =a + bi, 
where x and y are real numbers; 
or x7 + 2xyit+t y*i* =a+ bi, 
or (x* —y*) + (2xy)i =a + bi, 
whence oe , ° 
From the first equation x = +,/y* + ais replaced 
into the second equation: 
+2y.Jy? +a = b. (RE) 
Raising both sides to the second power one gets: 
4y?(y* +a) = b’, 
or 
Ay* + 4ay* — b? = 0. 
Let z = y*. Then 4z? + 4az — b* = 0, then 
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—4a+./16a2—4:-4(—b?) —-4a+v16a? + 16b? 
ZO —_ = rrmRaoau_ ——————— 


2(4) 8 
_—-4at4Va2 +b? -a+va2 +b? 
7 8 = 2 
Then 
—a+va? + b2 
y=e | 
Z 
and 
b b +b 
MEO Ee Oona EE 
AY [ratvare? 2 V2at2Va? +b? 
“< 2 2 
for y # 0. 


Since (RE) is a radical equation, we need to check 
each solution of unknown y to make sure the 
solution is not extraneous. 

a al q2 2 

Becausev a? + b? > +a, the expression oe > 
0, therefore there are at least two real values for y, 
—a+ va? + b? 

2 J 

while -a — Va? + b* < 0 and one has equality only 

when b = 0, resulting in y = 0. 


y=r 


As a particular case, Vi = 8 + 2. i, or - a2 ee 


since we write: 

i=0+1-:i, whence a=0O, b= 1, 

and we replace both of them into the x and y of 
previous formulas. 

We can check the results: 
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2 
v2 v2 2, 2 ,22_1,, 1 
FG Tae ger gee ag 
v2 By ai 
—=-=i) =i. 
2 2 


Let’s have another example, doing all 
calculations: 


and similarly ( 


Vv3-—4i=? 
Denote V3-4i=x+ yi. 


Then 
3-41 = (x + yi)? = (x* —y*) 4+ (2xy)i. 
Whence 
a =y°=3 
2xy = —4. 
Solve this system. 
From the second equation, y = =, and replace y 


into the first: 


or 

x*-—--3=0, 
or 

x* —3x2-4=0, 
or 

(x? —4)(x2 +1) =0, 
whence 
x?-4=0, 
or 
x= +2. 


47 


Florentin Smarandache 


Then 
= Se = =e =i 
aa +2 
Solutions: 
V¥3—4i=+4(2-i). 


Checking the result: 
[+(2 -1)]* =4-4i1 +i? =3 - 41. 
Remarkably, wel get the same solutions if we 
take the complex values of x and y, because: 
x? +1 =O gives x = +V-1 = +i, 


and replacing them into the substitution y = a 


we get: 
2 SL A OS Zo ZI 
Yea a, Gage ee 
Then 
V¥3-—4¢=x+yi = tit 2i-i=+tit2(-1) =42Hi 
= +(2 — i). 


One generalizes this procedure and one 
computes root index n of any complex number: 


Vat bi =? 


Similarly denote: 
Vat bi=x+ yi, 
then 
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at bi=(x+ yi)" = (yitx)” 
a 
= > (2k y 2k j2k yn-2k 
nN 
k=0 
ieee 
2 
a > C2k+1y2k+1 p2k+1 yn—2k-1, 
n 
k=0 
and one obtains a non-linear algebraic system of 
degree n, in two variables x and y, and two 
equations: 
I 


»: (2k y2k (4) yn-2k = g 
nN 
k=0 
n-1 
ew 
bs (2k+1y2k+1(_4)kyn-2k-1 — p 
k=0 
that one solves with a computer program. 
As a particular case, let’s compute the cubic root 
of a complex number: 


Va+bi=? 
Then: 
Vat+bi=x+ yi, 
or 


at+bi=(x+ yi)? =x3 + 3x*yi + 3xy7i? + yri3 
= (x3 — 3xy*) + (3x2y — y3)4, 
whence 
x3 -—3xy*=a 
ees = =p 
and solve for x and y. 
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From the first equation: 


yee x3-a 
eis 3x 


Replace this substitution into the second 
equation: 


and solve this independent equation for x with a 
calculator, and then find y from the above 
substitution. 

For example: 


Vi = -i. 
Neutrosophic Real or Complex Polynomial. 


A polynomial whose coefficients (at least one of 
them containing J) are neutrosophic numbers is 
called Neutrosophic Polynomials. 

Similarly we may have Neutrosophic Real 
Polynomials if its coefficients are neutrosophic real 
numbers, and Neutrosophic Complex 
Polynomials if its coefficients are neutrosophic 
complex numbers. 

Examples: 

P(x) =x*+(2-Dx-543I] 
is a neutrosophic real polynomial, while 
Q(x) = 3x2 4+ (14+ 6i)x? + 5Ix — 4il 
is a neutrosophic complex polynomial. 
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From these polynomials we proceed to solving 
Neutrosophic Real or Complex Polynomial 
Equations. 

Let’s consider the following neutrosophic real 
polynomial equation: 

6x? + (10 —I)x + 31 =0, 
and solve it just using the quadratic fromula: 
_ -@0-N + J/G0- 1)? - 46) 3) 
. 2(6) 
—10+/+v100 — 20/ + [2 —72I1 
12 
—10+/+4-¥100 — 201+ 1-—72I 
12 
—10 +1 +¥100 — 917 
7 12 

Now, we need to compute V100 — 911. 

Let’s denote: V100 — 91] = a + BI, 

where a, £ are reals. 

Raise both sides to the second power: 

100 — 91] = a? + 2aB1 + B21? = a? + 2a Bl + B21 
= a* + (2aB + B*)I, 
whence 
a* = 100 
2ap + B2 = —-91., 

Hence a = +V100 = +10. 

1. If a=10, then 2(10)8+ B82 =—-91, or B*+ 
206 + 91 = 0. Using the quadratic formula, one gets: 
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= —20 +202 -—4(1)91 _ —-20 + V400 — 364 


2 2 
—20 + 6 
—-2046 0 QF 
~~ 2. 20-6 
—— = -13. 
2 


2. If@=-10, then £7 — 208 + 91=0, 
g = 20+ V¥C20)? — 491 _ 204 V400 — 364 


2 
20+6 
20+6 i 
= =( 906 
= 7, 
2 


The four solutions are: 

(a, B) = (10, —7), (10, —13), (—10, 13), (—10, 7). 
We go back now and find x: 
10+ s100= 9:47 
ee — a = 

Therefore, we previously found out that 
¥100 —- 917 =10-—7/, or -—10+7/, or 10-—13/, or 
—10 +131. 

Since one has + in front of the radical, 10 — 7i 
and —10+7/ get the same values for x. Similarly, 
10 — 137 and —10+ 13/. 

—10+/+ (10-71) 


x 


*1,2 = 12 

MO etsy .<6r.. a 
Sha. en 
MOLTa40407. =20er “5. 0 
ae FE RR i 27 
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~10 +7 + (10-137) 
X42 = 


12 
—10+/+10—137 12) | : 
= 12 ~ 2 
—10+1/—10+137 -—20+141 _ 107, 
12 7 12 — 6 6° 


We got four neutrosophic solutions 
{[-5h-34+5L-L-2 +21} for 
2 3 3 6 6 
a neutrosophic real polynomial of degree 2. 
First neutrosophic factoring: 
P(x) = 6x2 + (10 —Dx +31 =6 E = (-31)| . E = 
5 2 1 10 7 
(-F +5) = (x +31) (e+ F-H). 
Second neutrosophic factoring: 


P(x) = 6x2 + (10 —J)x + 31 
= 6[x —(-I]- E — (-=+ =!) 


eG 
= 6( +51)( sce =!) 
SN Cy ee eo) 


Differently from the classical polynomial with 
real or complex coefficients, the neutrosophic 
polynomials do not have a unique factoring! 

If we check each solution, we get: 

P(x1) = P(x2) = P(x3) = P(%4) = 0. 

Let’s compute: 
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55 (= ay 10 n(> 1 \ a3 
tlt) SOOm De tee 
100 140 49 —~100\ 70 
=< Mer 21). 
a6 «86 36 6 6 
Reamer any, 
E26 


100 140-7 49-7 100 70-1 10-1 


Bo Ge ey he ee cee 
7:I 18/1 

6 6 
_ 1401 +491 + 701+ 101-71 +18% Ov! 
zs - a 
=7=0 


Another procedure of factoring a neutrosophic 

real polynomial equation is the following. 

Let’s have 

P(x) = (A+B-Dx*+(C+D:Dxt+(E+F-:1I=0. 

Suppose x, =a,+b,l and x, =a,+b,I are two 

neutrosophic real solutions of P(x) = 0. 

Then: 

P(x) = (A+B-I)[x — (aq, + b,D)]- [x — (@, + B21] 
=(A+B-I)x*+(C+D-I)x 
+(E+F°1). 

We multiply on the second right hand side, and 

then we identify the neutrosophic coefficients, and 
solve for a,,b,,az,andbp. 
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Research Problems. 


1. In general, how many neutrosophic solutions 
has a neutrosophic real polynomial equation of 
degreen>1? 

So far, we know that such equation of degree 1 
has none (in the case the neutrosophic division is 
undefined) or one solution (in the case the 
neutrosophic division is well defined). 

2. How many different factorings, with factors 
of first degree, are possible for a neutrosophic real 
polynomial of degree n? We got two different 
factorings for aparticular polynomial of degree 2. 

3. -— 4. Similar problems for neutrosophic 
complex polynomial equations and neutrosophic 
complex polynomials of degree n = 1. 
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Neutrosophic Random 
Numbers 


Neutrosophic Random Numbers can also be 
generated using, instead of only crisp numbers, a 
pool of sets. For example, let’s suppose one has 100 
balls and on each ball is written an interval [a,b] 
where a,b € {1,2,3,...,100}and a < b. 

When a=b we get a crisp number [a,a] =a, 
while for a < b we get a set/a, bj. 

Then randomly one extracts a ball, one registers 
its interval, then one returns it back to the pool. 
And so on. Instead of a random sequence of crisp 
numbers, we get a random sequence of intervals. 
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Example with 
Neutrosophic Data 


Let’s have the following four observations: 


6,[2, 5], 30, [18, 24]. 

The second and fourth observations are unclear, 
i.e. [2,5] means a number in this interval, but we 
don’t know which one; similarly for [18, 24]. 
Therefore we have two indeterminacies. 

In order to uniformize let’s rewrite all 
observations as intervals: 

[6, 6], [2, 5], [30, 30], [18, 24]. 

Each observations can be a_e subset, not 
necessarily a crisp number a (closed, open, half 
closed — half open) interval. 

Compute the median: 

[2,5] + [30, 30] | [2+ 30,5 + 30] 2 35 
Z 2 Pars 
= [16,17.5]. 

Therefore the medium is a number between 16 
and 17.5. 

One computes their mean: 

[6,6] + [2,5] + [30,30] + [18, 24] 

4 
-ereye 6+5+4+ 30+ 24] 
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Therefore the average is a number between 14 
and 16.25. 
Compute the deviations and square them: 
a. [6,6] — [14, 16.2] = [6 — 
16.2,6 — 14] = [10.2, —8]; 
[10.2, -8]? = [—10.2, —8] - [—10.2, —8] 
= [(-8)(-8), (-10.2) - (-10.2)] 
= [64, 104.04]. 
b. [2,5] — [14, 16.25] = 
[2 — 16.25,5 — 14] = [-14,25, -9]; 
[—14.25, —9]2 = [(—9)2, (—14.25)2] = [81, 203.0625]; 
C. [30, 30] — [14, 16.2] = 
[30 — 16.2, 30 — 14] = [13.8, 16]; 
[13.8, 16]? = [13.82, 162] = [190.44, 256]; 
d. [18, 24] — [14, 16.2] = 
[18 — 16.2, 24 — 14] = [1.8, 10]: 
[1.2, 10]? = [1.82, 102] = [3.24, 100]. 


Compute the standard deviation: 


[64, 104.04] + [81, 203.0625] + [190.44, 256] + [3.24, 100] 
4 


_ ie + 81+ 190.44 + 3.24 104.04 + 203.0625 + 256 + _ = 
a 


= ,/[84.67, 165.775625 = 
[V84.67, V165.775625| = [9.20163, 12.8754]. 
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Indeterminacy related to 
the sample size 


Suppose one has the following five observations: 


17,412,533, 9; 
but one of them is certainly wrong, yet we don’t 
know which one. 
What to do to approximate the calculations? 
Let’s first increasable reorder the observations: 
5, 8,9, 12,17, 
and then study all possibilities. 


Sample Wrong Correct Median Deviations Squared Standard 
Number Observations Observations Deviations Deviation 


33.0625 
3.0625 
1.5625 


22.5625 
3.0625 
0.5625 

$2.5625 


Florentin Smarandache 


Now we combine the five results. 
a. Interval style: 
the median belongs to the interval [8.5, 10.5]; 
the mean belongs to the interval [8.5, 11.5]; 

the standard deviation belongs to the interval [2.5, 4.43706]. 

b. Average Style: 
10.54+10.54+1004+8.5+8.5 
rs = 9.6; 
11.5+10.75+10.5+9.754+8.5 


themean = - = 102: 


themedian = 


andstandarddeviation 
c 3.5 + 4.38035 + 4.5 + 4.43706 + 2.5 


5 
~ 3.86348. 


c. Weight Average Style: 

One assigns a weight to each sample. The sample 
weight may represent the chance that the respective 
sample could be the right sample, after discarding 
the wrong observations. 

In general, the weights w,,w2,...,Wn € [0,1] such 
that 

Wy +W2++:+W, = 1. 

In the case when the sample weights are 
determined from criteria different from each other 
and therefore the sum of weights is not 1, and the 
observations are a,+a,+-°:+a,, the weight 
average is: 

W1Q4, + W2Q2 ++ + Wy An 
Wy, + Wz +++ Wy 
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In our example, if w, = 0.4,wz = 0.1,w3 = 0.3,w4 = 
0.2,w, = 0.7, then: 
theweightedaveragemedian 
_ 0.4(10.5) + 0.1(10.5) + 0.3(10.0) + 0.2(8.5) + 0.7(8.5) 
“=. - (4 oO sc02n09° °° 
=~ 9.35294; 
theweightedaveragemean 
_ 0.4(11.5) + 0.1(10.75) + 0.3(10.5) + 0.2(9.75) + 0.7(8.5) 
---—. —_ (amoiwosnoreae = - = 
~ 9.83824 
and theweightedaveragedeviation 
_ 0.4(3.5) + 0.1(4.38035) + 0.3(4.5) + 0.2(4.43706) + 0.7(2.5) 
i 0.4 + 0.1 + 0.3 + 0.2 + 0.7 
~ 3.42673. 

According to the sample weights, it’s a larger 
chance that the right sample is the fifth one. 
Therefore, the combined statistical metrics of all 
samples would be inclined to approach the fifth 
sample’s statistical metrics. 

This example can be generalized for n 
observations, such that k observations among them 
are wrong, wheren>2 and1<k<n-1. 

With a computer program, one studies each of 
the C?-* samples resulted after discarding k wrong 
observations, where C"-* means combinations of n 
elements taken in groups of n-k elements. Each 
sample has the size n-k. For each sample one 
calculates its median, mean, deviations, standard 
deviations, and of course other statistical metrics 
required by the neutrosophic problem to solve. 
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Then we combine C"-* results using either 
interval style, the average style, the weighted 
average style, or other procedures that the reader 
may design depending on the problem. 
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Neutrosophic Binomial 
Distribution 


The classical Binomial Distribution is extended 
neutrosophically. That means that there is some 
indeterminacy related to the probabilistic 
experiment. 

Suppose each trial can result in an outcome 
labeled success (S), or its mutually exclusive 
outcome labeled failure (F), or some indeterminacy 
(J). 

For example: tossing a coin on an irregular 
surface which has cracks, the coin can fall inside a 
crack on its edge, and thus one gets neither head, 
nor tale, but indeterminacy. 

We conduct a fixed number of small experiments 
(that we call trials). The outcomes of the trials are 
independent. For each trial, the chance of getting S 
is the same; similarly for the chance of getting F, or 
of getting I. 

The neutrosophic binomial random variable 
xis then defined as the number of successes when 
we perform the experiment n = 1 times. 

The neutrosophic probability distribution of x 
is also called neutrosophic binomial probability 
distribution. 

For n trials it is important the way one defines 
the indeterminacy. 
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First, it is clear that getting indeterminacy in 
each trial means indeterminacy for the whole set of 
n trials. Secondly, getting indeterminacy in no trial 
means no indeterminacy for the whole set of ntrials. 

But what about getting indeterminacy in some 
trials, and determinacy (i.e. success or failure) in 
other trials? 

This partially indeterminate and _ partially 
determinate set of n trials depends on the problem 
one needs to solve and on the expert’s point of view. 

One can define an indeterminacy threshold: 

th = number of trials whose outcome is indeterminate, 
whereth € {0, 1, 2,...,n}. 

The cases with a threshold > th will belong to the 
indeterminate part, while for a threshold < th they 
will belong to the determinate part. 

Let P(S) = the chance a particular trial results in 
a success, 

and P/(F)= the chance a particular trial results in 
a failure, for both S and F different from 
indeterminacy. 

Let P/l) = the chance a particular trial results in 
an indeterminacy. 

For x € {0,1,2,...,n}, NP (exactly x successes 
among n trials) = (7,,,1,,F,), with 
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PS): =e _ P(DEP (FY? 


Ty ~ Xx! oa 
= nl x 
Gay ie 
n— 
>) Ge waar oe 
_n! : * P(I)KP(F)"--# 
= PO) a k!(n-—x-—k)!- 
Similarly: 
n n nl th P(r) ; P(F)R-9-k 
_ SNe 5 y. Se Ne =. 
B=) Ty ae oo >, ki(n-y-—k)! 208 
y=0 y=0 k=0 
VFX VFX 


al 
In = >» zi(n—z)! ee) 


z=th+1 
| ») Ci-2P(S)* ree 
k 
| 


nN-Z Cia ake 
Ryan iP P®) / 


z=th+1 


[OP Py 
» k!(n-—z—k)! / 


k=0 
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where (}, means combinations of u elements 


taken by groups of v elements: 
ie ul 
Cu= vi(u-—v)! 

and wu! is u factorial, uw} =1-:2:3>-...:u. 

Also: 

T, = chance of x successes, and n-—~x failures 
and indeterminacies but such that the number of 
indeterminacies is less than or equal to 
indeterminacy threshold; 

F, = chance of y successes, with y#x and n— 
yfailures and indeterminacies but such that the 
number of indeterminacies is less than or equal to 
the indeterminacy threshold; 
and I, = chance of z indeterminacies, where z is 
strictly greater than the indeterminacy threshold. 

Te + Le + Fe = (P(S) + PCD + P(F))". 

In most applications, 

P(S) + P(I) + P(F) = 1, 

and this case is called complete probability. 

But for incomplete probability (where there is 
missing information): 

0 < P(S) + P(I) + P(F) <1. 

While in the paraconsistent probability (which 
has contradictory information): 

1 < P(S) + PU) + P(F) <3. 


An Example. 
Among the watches sold by a store 80% had a 
digital display and 10% an analog display. There is 
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a number of watches sold for which the storeowner 
has no evidence about their type of display, and he 
asks his manager assistant about them. Not 
knowing the manager’s previous estimations, the 
assistant estimates the unknown type of watches to 
be 20%. 

Let’s consider a neutrosophic random variable 

x = the number of watches among the next 5 
buyers that have an analog display. 

Therefore: 

P(F) = P(digitaldisplay) = 0.8, 
P(S) = P(analogdisplay) = 0.1, 
P(1) = P(indeterminacy) = 0.2. 

We got a paraconsistent neutrosophic probability 
since the information comes from the different 
sources that estimate independently. We have 
contradiction between the estimations of the 
manager and his assistant, because 

0.84+0.14+0.2 =1.1> 1. 

We have a neutrosophic binomial distribution. 

Let’s say the indeterminacy threshold is 2. 

We define the random variable X as follows: 

x = number of watches that have an analog 
display among the next 5 watches to be bought; 


“1 0.2)" (0.8)5-*-k 
ki(5—-x—k)!’ 


5! 
= x, 
Ty = (0-1) 


where x = 0,1,2,3,4,5. 
The chance that exactly 2 watches are analog, i.e. 
x = 2is: 
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= > (0.1)? , eee ee 
a = 0.0992. 
2 eZ yo (Oz »y oe aa 
= > (0.2)° 
2 = 
| | ear | (forz = 3) 


+ (0.1)*(0.8)1-# 


+7o0"-] ones 
4l L ki(1-b)! 


| (forz=4)+ 


0 
5! 0.1)*(0.8)1-* 
= 02) | eee 2 a (fore 5) 
k=0 
(0.1)°(0.8)2  (0.1)1(0.8)! (0.1) 2(0.8)° 
= 2002) io 1! 1! 210! 
(0.1)°(0.8)! (0.1)1(0.8)° 
+5 -(0.2)*: O11! 1!0! 
0 0 
+1-(0.2)>- aera = 0.07232. 


F2 can easier be computed (instead of using its 
combinatorial formula) as : 
F, = (P(S) + PU) + P(F))5 —T2 -—b 
= (0.1 + 0.2 + 0.8)° — 0.0992 — 0.07232 
= 1.43899. 
If we normalize the vector 
(T>, I>, Fy) = (0.0992, 0.07232, 1.43899) 
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by dividing each vector component by their total 
sum 

0.0992 + 0.07232 + 1.43899 = 1.61051, 

we get (72, Io, F2) = (0.061595, 0.044905, 0.893500). 

For incomplete and paraconsistent probabilities 
it doesn’t matter if we normalize at the beginning or 
at the end, we'll get the same result. 


Remark. 

Since a third component (the chance of 
indeterminacy) was added to the _ binomial 
distribution, the neutrosophic binomial distribution 
actually resembles a summation of classical 
trinomial distribution: 

(p, +it+p2)” 

where p, and pz are the probabilities that the two 
mutually exclusive events (Ff, and EF, ) occur 
respectively, while «» is the chance of getting 
indeterminacy. 

Let’s denote by A(a,f,y) the probability of 
obtaining a events F,, 8 indeterminate events, and 
y events E,, where of course 0<a,f,y <n, anda+t 
B+y =n, as results of n independent trials. 

Of course, as in classical trinomial distribution, 


one has 


n! BY 
A(a,B,y) = api Pit 2 


with n=a+fh+y. 
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We need to define what indeterminacy means 
within n trials. Let th be the indeterminacy 
threshold. For th+ 1 or more indeterminacies, we 
consider them as indeterminacy, otherwise we have 
determinacy. 

Then for x € {0, 1, 2, ...,n}, 

NP(exactly x events E, among n trials) = (T,, 1,, F,), 
where : 


Ty, = AQ Bb wt=x- PB) 


i > A(a, B,n—a—B) 
th+1sBs<n 
O<sa<n-th 


fe ». A(a, B,n-a—B) 
Osasn, A#X 
O<Bsth 
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Neutrosophic Multinomial 
Distribution 


The previous neutrosophic binomial distribution 
is generalized for the case when at each trial there 
are r(=2)_ possible outcomes and = some 
indeterminacy. 

Suppose all possible outcomes are 


E,, Eo, «., Ey 
with corresponding chances to occur 
PrP. cca. 


and some indeterminacy I with corresponding 
chance to occur 1. 

Then we have the multinomial expansion: 

(Pi + Pp +++ P. +i)” 

for n trials. 

Let’s denote similarly by A(q@,,q@,...,a,fB) the 
probability of obtaining: exactly a, events F,, a, 
events E,, ... , a, events E,, and B indeterminate 
events, 

where 0 < Qj, @,...,@,B <n 

and a,+ a,+°::+a,+f =n, 

as results of n independent trials, then 


Al iy Hija HOD) 


n! a a a . 
= ae ee 
ey! Go! cw Hy! PB 


Consider the same th as indeterminacy treshold. 
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Let the random variable x; denotes the number 
of times events E; occurs, for any j € {1,2,..,r},in n 
independent trials. 

So we have a multivariate distribution. 

Then the neutrosophic probability of obtaining 
exactly x, events F,, x, events Fp, ..., x, events E,, 
in n trials is 


(Te. XQ Xp? Ix, XQ Xp? Fy, Sete): where 


Ty, 1 en A(x, XQ sy Xry Bf) 
o<Bsth 
Ty, ate ao >. A(a@, AQ ++) Ay, Bf) 
th+1<B<n 


O<ajsn-th, for jE{1,2,...7} 


Fy, Ro-ieshes — 


> 0<P<th A(Q1,@2, ..., Ay, B). 


(Og M5 ey ETL 2 MY VG Noh) 
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Neutrosophic Scatter Plot 


ANeutrosophic Scatter Plot is a picture of 
points (x, y), such that at least a point is not well 
defined. 

For example the point (3, 5) is well defined, while 
the points ([2, 4), 7) or (-6, [O, 1]) or ({-—2,—4},3) or 
([1, 2], [5, 7]) are imprecise. 

As an example, let’s consider a sample of size 

n= 4 yielding the accompanying data: 


Neutrosophic Observation 


1) 1 2 3 4 5 6 x 
2D NEUTROSOPHIC SCATTER PLOT 
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The bivariate neutrosophic scatter plot has, 
besides points as in classical scatter plot, also 
segment of lines or parts of segments of lines, or 
surfaces, or parts of surfaces (geometrical objects of 
dimensions 1 or 2). 

In general, an n-variate neutrosophic scatter 
plot formed by n- 1 independent variables and one 
dependent variable, is composed of geometrical 
objects of dimensions O, 1, 2, ..., or n. 

A neutrosophic dependent or response 
variable is a dependent variable that has some 
indeterminacy. 

Similarly, a neutrosophic independent or 
predictor variable is a variable that has some 
indeterminacy. 

A neutrosophic function 

fy 1X2), %) = 0 
is a function depending on variables %1,X%,...,Xn 
such that the function has at least one 
indeterminate coefficient, or at least one of its 
independent variables %,,X,...,X, has some 
indeterminate value or is unknown. 

Indeterminate coefficient or indeterminate value 
can be a subset with two or more elements. 

The graph of a neutrosophic function in general 
has a higher dimension than the graph of a 
corresponding classical function (whose indeter- 
minacies have been removed). 
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For examples, the classical function f(x,y) =0 
represents a curve in the 2D-space, while the 
neutrosophic function fy(x, y) = 0 can be a surface. 

The classical function f(x,y,z) =0 represents a 
surface in 3D-space, while the mneutrosophic 
function fy(x%,y,z)=0 can represent a_ bigger 
surface or a solid. 

And in general while a classical function 

f(ki, Xa, ..., Kn) = O 
is a geometrical object of dimension d in the n- 
dimensional space, a neutrosophic function 
fy(K1, Xa, ..., Xn) = O 
is a bigger (as volume) geometrical object of 
dimension d, or a geometrical object of dimension > 
d. 

The study of a neutrosophic function becomes 
more difficult when, for example, a function’s 
coefficient or a value of one of its independent 
variables is completely unknown. 

More classical statistical formulas can be 
neutrosophically extended by replacing the 
operations on crisp numbers with operations on 
sets, that we present below. 


Let’s S,; and Sz be two sets of numbers. 

Then: 

S$, +S = {x1 + x2|x, € Syandxz € S,}(set addition) 
S1 —S = {x1 — x2|x, € S,;andxz € S,}(set substraction) 
S,° So = {x1 + x2|x, € S,andx, € S,}(set multiplication) 

a:S; =S,:a= {a+ x,4|x, € S,}(scalar multiplication) 
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a+S,=S,;+a={a+t+x,|x, € S,}(scalar set addition) 
a—S, = {a-—x,|x, € S,}(scalar set substraction) 
S$, —a = {x, —a|x, € S,}(scalar set substraction) 

S4 {= 


ele Xz © S4,X2 € S2,X2 #0 } (set division) 
2 


x2 
St = {xf |x, € S,}(set power) 


S 
: {= x E Sy,a# 0} (set scalar division) 


Sy 
a 51 = {*/x1|x, € S,} (root indexnof a set) 


a 
{= lx E $1,%1, # 0} (set scalar division) 
1 


As generalizations wehave: 
m 
> Si = (0%; |x; € S;for alli = 1,2, ..., mb}. 


— 


Similarly : 


m 
| |s = {T]j2, x; |x; € S;for alli = 1,2, ..., m}. 
i=1 
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Neutrosophic Regression 


Neutrosophic Regression is the analysis of the 
association between one or more independent 
variables and a dependent variable that are 
expressed by neutrosophic values. This association 
is usually formulated as a neutrosophic equation or 
formula, which enables prediction of future values 
of the dependent variable. 

The graph of this association is, instead of a 
curve in classical statistics, for example: 


a neutrosophic curve (we can call it a ,thick curve”, 
or ,Strip curve”), like: 
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me 


since in neutrosophic theory one deals with 
indeterminacy and approximations. 

As in classical statistics, the neutrosophic 
regression may be linear (if the association between 
independent, and dependent variables is linear), or 
non-linear (if the association is non-linear). Among 
the neutrosophic non-linear regressions of second 
degree one mentions the parabolic, elliptic, and 
hyperbolic regressions. 
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Neutrosophic Least- 
Squares Lines 


The Neutrosophic Least-Squares Lines that 
approximates the neutrosophic bivariate data 
(X11), 2, V2), 1 Xn Vn) 
has the same formula as in classical statistics 
Y=atby 

where the slope 
» xy = [ENEy)/a 

yx" = 1052)" [nl 
and the y — intercept 
a=y-—bx 

with x the neutrosophic average of x, 

and ythe neutrosophic average of y. 

One uses the circumflex accent “ above yin order 
to emphasize that f is a prediction of y. 

The only distinction from classical least-square 
line is that in neutrosophic theory we work with sets 
instead of numbers. 

Therefore, into the data, some x’s or y’s are 
imprecise, expressed by sets. The consequence is 
that «a» or« b» could result in being sets instead 
of numbers. 

Let’s see an example. 
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Neutrosophic Neutrosophic Neutrosophic 
Observation Predicted Value fi Residual yi, yi 
ane 
| 26] | | -2:1.3587, 18.7955) | (-17.7985, 24.3587) | 


es ee 16, 25 2 30 a -20.5014. 385603) | (-32.5603, 26.5014 
-21.7871,12.2073)_| (-10.2073, 23.7871 


——— 7 13 ———————— EE on 91 cece 169) -19.6443, 51.736 -41.7367, 32.6443 
pS SCT C5) TH f02, 120) | 196,225} | (-18.7871, 58.325) | (-44.325, 33.7871 


ae a -20.93, 25.3838 -20.3838, 25.93 
Se —— 2828 24. 26 Aas teal 152 i 2 a sa 3 


TABLE OF A NEUTROSOPHIC SAMPLE 


An example of calculation with sets: 
7 (13146424 4019)45 414,15) 
=(1+64+2+10+5,34+64+2+4+13+5) 
+ {14,15} = (24,29) + {14,15} 
= {(24,29) + 14, (24,29) + 15} 
= {(38, 43), (39, 44)} = (38, 44). 


Whence: 
_ (215,264) ~ [(24, 26) a 
(130,152) (22) 
(215, 264) — [=] 
~ (130,152) — eee | 


(215,264) — (152,191) (24,112) 
~ (130,152) — (96,113) (17, 56) 
= (= =) (0.42857, 6.58824) 
5671770 = 
Since 
(24, 26) 


x= = ~ (4, 4.33333) 
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and 
_ (38,44) 38 44 
6 io = (==) ~ (6.33333, 7.33333) 
6 6 6 
we get 


a = (6.33333, 7.33333) — (0.42857, 6.58824) - 
(4, 4.33333) = (6.33333, 7.33333) — (1.71428, 28.549) = 
(—22.2157,5.61905). 

Thus, the neutrosophic least-squares line is : 

§ = (—22.2157, 5.61905) + (0.42857, 6.58824)x. 

Let’s graph this «ine», which actually is a 
geometrical surface between two lines. 

If x = 0,9 = (—22.2157, 5.61905). 

If x= 1,9 = (—22.2157 + 0.42857, 5.61905 + 
6.58824) = (—21.7871, 12.2073). 

We plot these neutrosophic points, which are 
actually segments of line. 
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Neutrosophic Predicted Values are computed as 
Ji = (—22.2157,5.61905) + (0.42857, 6.58824)x;, 
fori = 1,2,...,6. 

Hence: 
¥q. = (—22.2157,5.61905) + (0.42857, 6.58824) - 2 
= (—22.2157 + 0.4285 - 2,5.61905 
+ 6.58824 « 2) = (—21.3587, 18.7955). 
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Vo = (—22.2157, 5.61905) + (0.42857, 6.58824) - [4,5] 
= (—22.2157 + 0.42857 - 4, 
5.61905 + 6.58824 : 5) 
= (—20.5014, 38.5603). 
V3 = (—22.2157, 5.61905) + (0.42857, 6.58824) «1 
= (—22.2157 + 0.42857, 5.61905 
+ 6.58824 1) = (—21.7871, 12.2073). 
Yq = (—22.2157, 5.61905) + (0.42857, 6.58824) - (6, 7) 
= (—22.2157 + 0.42857 - 6,5.61905 
+ 6.588247) = (—19.6443, 51.7367). 
Ve = (—22.2157, 5.61905) + (0.42857, 6.58824) - (8) 
= (—22.2157 + 0.42857 - 8,5.61905 
+ 6.58824: 8) = (—18.7871, 58.325). 
Ve = (—22.2157, 5.61905) + (0.42857, 6.58824) «3 
= (—22.2157 + 0.42857 - 3,5.61905 
+ 6.58824: 3) = (—20.93, 25.3838). 
The Neutrosophic Residuals are computed in 
the same way as in classical statistics: 
V1 — Vr V2 — 92, Yn = Vn 
where y; are the real values of variable y, 
and V, are respectively their predicted values. 
The neutrosophic residuals are: 
¥1—-¥i = [1,3]—[(22.2157, 5.61905) 
+ (0.42857, 6.58824) - 2] 
= [1,3] — (21.3587, 18.7955) 
= (1 — 18.7955, 3 — (—21.3587)) 
= (—17.7955, 24.3587). 
Yo — Va = 6 — [(—22.2157, 5.61905) + (0.42857, 6.58824) 
-[4,5]) = 6 — (—20.5014, 38.5603) 
= (—32.5603, 26.5014) 
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V3 — 3 = 2—[(—22.2157, 5.61905) + (0.42857, 6.58824) 
-1] = 2—(-21.7871, 12.2073) 
= (—10.2073, 23.7871). 

V4 — Va = (10, 13) — [(—22.2157, 5.61905) + 
(0.42857, 6.58824) - (6,7)] = (10, 13) — 

(—19.6443, 51.7367) = (—41.7367, 32.6443). 

Ys — Ve = {14,15} — [(—22.2157, 5.61905) + 
(0.42857, 6.58824) - 8] = {14, 15} — (18.7871, 58.325) = 
(—44.325, 33.7871). 

V6 — Ve = 5 — [(—22.2157, 5.61905) + (0.42857, 6.58824) 
-3] = 5 — (—20.93, 25.3838) 
= (—20.3838, 25.93). 

It is remarkable to observe that each real value 
of belongs to or it is included in the predicted value 
interval: 

yz = [1,3] ¢ (—21.3587, 18.3955); 

Yo = 6 € (—20.5014, 38.5603); 

y3 = 2 € (—21.7871, 12.2073); 

y, = (10,13) € (—19.6643, 51.7367); 

ys = {14,15} c (—18.7871, 58.325); 
V6 = 5 € (—20.93, 25.3838). 


Deneutrosofications. 


a. Another idea of solving this problem would 
be to transform the neutrosophic data in classical 
data, either taking the midpoint of each set, or the 
average of a discrete set of the form {...}. Or taking 
small neighborhoods centered in the midpoints of 
each set. Or taking the minimum values of the sets 
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and thus constructing multiple classical data. Then 
one computes the least-squares line for each data. 
Afterwards one makes the average of the results, or 
one considers the min/max interval of the results. 

b. Or one transforms the neutrosophic least- 
squares line into a classical least-square line by 
replacing the set representations of the coefficients 
«avn and «b» by their corresponding midpoint, or 
(depending on the application) by other interior 
points of the two sets. In our previous example, 

¥ = (—22.2157, 5.61905) + (0.42857, 6.58824) -« x 

becomes 


Y=-—8+4 3.5x, 
where —8 is close to the mipoint of 
(—22.2157,5.61905), 
and 3.5 is close to the midpoint of 


(0.42857, 6.58824). 

c. One could take the midpoints of the 
neutrosophic predicted values  neutrosophic 
residuals, or initial neutrosphic data; or smaller 
neighborhoods centered in the midpoints; or min 
values and max values separately and obtaining 
multiple classical data and calculating the needed 
statistical characteristic for each of them, then 
averaging the results. 

Let’s compute the midpoints of neutrosophic 
predicted values and neutrosophic residuals: 
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Neutrosophic Neutrosophic 


Predicted Value Residual Midpoint 
Midpoint 
-1.2816 3.2801 
9.0295 -3.0295 
-4.7899 6.7899 
16.0467 -4.5462 
19.7690 -5.2690 
2.2269 2.773 | 
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Neutrosophic Coefficient 
of Determination 


We compute the Neutrosophic Residual Sum of 
Squares, denoted by NSSResid, given by: 


NSSResid = ».0- 9 = >” -a) y-b) xy 


and the Neutrosophic Total Sum of Squares, 
denoted by 

2 

NSSTo = »o —yy= >” _ Qyy 


n 
The Neutrosophic Coefficient of 


Determination, denoted by rj, is : 

4 NSSResid 

rf =1-—____, 

NSSTo 
and represents the proportion of variation in y, 
when considering a linear relationship between 
variablesx and y. 
NSSResid = 3.2801? + (3.0295)? + 6.78997 
+ (—4.5462)? + (—5.2690)? + (2.7731)? 


= 122.16; 
2 38, 44)2 
NSSTo = >» _ Co" = (362,468) — oe 
= (362, 468) eas 
7 d 6’ 6 


= (362, 468) — (40.1111, 53.7778) 
= (362 — 53.7778, 468 — 40.1111) 
= (308.222, 427.889). 
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Whence 
Poe Pe 122-16 oe (= °16 122.16 ) 
Nv ~ (308.222, 327.889) 327.889’ 308.222 
= 1 — (0.3726, 0.3963) 
= (1 — 0.3963, 1 — 0.3726) 
= (0.6037, 0.6274). 

So between 60.37% and 62.74% of the sample 
variation is explained by the neutrosophic 
approximate linear relationship between x and y. 

The Neutrosophic Correlation Coefficient or the 
product moment neutrosophic coefficient Ty 
(extension of Pearson’s correlation coefficient from 
crisp data to neutrosophic data), has the same 
formula as in classical statistics, but we work with 
sets instead of numbers: 


NIV DAD 


** nea OR =O 
or 
au Sxy 
HF eae 


where 5S,,, is the neutrosophic covariance of x — 
and y-—values, and S,,S, are the neutrosophic 
sample standard deviations. 

Let’s consider the example from the previous 
Table of Neutrosophic Sample of size 6. 
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r, 
. 6 + (215, 264) — (24, 26) - (38, 44) 
V6 * (130, 152) — (24, 26)? - [6 « (362, 368) — (38, 44)?] 
(6-215,6: 264) — (24: 38,26: 44) 
[(6- 130,6- 152) — (242, 262)] - [(6 - 362,6- 468) — (382, 442)] 
(1290, 1584) — (912, 1144) 


[(780, 912) — (576, 676)] - [(2172, 2808) — (1444, 1936) | 
(1290 — 1144, 1584 — 912) 


(780 — 676,912 — 576) - (2172 — 1936, 2808 — 1444) 


(146,672) 7 (146,672) 


/ (104, 336) « (236,1364) (104: 336,336: 1364 
(146, 672) _ (146,672) 
~ (¥34944, 458304) (186.933, 676.982) 
146 672 _ 

= (srs eas} ~ (0.2157, 3.5949) = (0.2157, 1]. 

In general ry is a subset of the interval [—1,1]. If 
ry is a subset of [0,1] then the points (x;, y;) for i = 
1,2,...,n, lie approximatively near a straight line of 
positive slope, while when ryis a subset centered 
or almost centered at O (or ry is nearly half in [0, 1] 
and nearly half in [—1,0] then their is virtually no 
linear approximation but their may be a non-linear 
association between the points. 


Neutrosophic Random Numbers is a sequence 
of numbers and indeterminacies occurring at 
random with equal probability. 

The occurrence of a number or indeterminacy is 
not a guide to the numbers or indeterminacies 
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that follow it, nor is it predicted from the numbers 
or indeterminacies that precede it. 

Using eleven balls numbered O to 9 and another 
one ball that has its number erased (which one 
cannot read, that we note by J), then repeatedly 
withdrawing a ball and putting it back to the 
container. 

We randomly generate the sequence: 

2,9; 9;1,0; 7,6; 27, LB ng 

where I = indeterminacy. 

The computers can be enabled to generate 
neutrosophic random numbers using the same 
classical algorithms as for classical random 
numbers, but adding one or more states of 
indeterminacies with an equal chance of occurring 
each of them. 

As a generalization we proposed the 
Neutrosophic Weighted Random Numbers, 
where each number x; has a different chance p; to 
occur, and each indeterminacy J; has a different 
chance 7; to occur. 

There are also cases when the numbers have to 
be in a given set; for example, each number should 
have k digits. 
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A Neutrosophic Normal 
Distribution 


A Neutrosophic Normal Distribution of a 
continuous variable X is a classical normal 
distribution of x, but such that its mean 1p or its 
standard deviation o (or varianceo’), or both, are 
imprecise. 

For example, p, or o, or both can be set(s) with 
two or more elements. The most common such 
distributions are when y, o, or both are intervals. 

The neutrosophic frequency function formula is 
the same, except, as explained in the introduction, 
replacing p by pn and o by on: 

1 x — Uy)” 
Xy~Nw (By ON) = Pl raid (- | 
where Xy actually means that variable X may be 
neutrosophic (i.e. having some indeterminacy), and 
similarly Ny (.,.) meaning that the normal 
distribution N(.,.) may be neutrosophic (i.e. having 
some indeterminacy). 

Instead of one bell-shaped curve, we may have 
two or more bell-shaped curves that have common 
and uncommon regions between them and are 
above the x-axis. Each one is symmetric with 
respect to the vertical line passing through the 
mean (X = jj). 
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Neutrosophic Probability Density 


As a first neutrosophic example for normal 
distribution, let’s consider a normal distribution 
with p = 15 and o = [2, 3]. Thus the standard 
deviation is indeterminate. 


Ny, (15, 27) 


Ny, (15,07),2 <0 <3 


Ny, (15, 37) 


Within one standard deviation of the mean 
translates in this first example by: 
pt+o=15+[2,3] =[15 —3,15 +3] = [12,18], 
or approximately 68% of values lie between 
x € [12,18]. 
Within two standard deviations of the mean 
translates by: 
pt2o0 =1542: [2,3] = 15+ [4,6] = [15—-—6,15 +6] 
= [9,21], 
or approximately 95,4% of values lie between 
x € [9,21]. 
We could also compute the last interval as: 
[12,18] +o = [12,18] + [2,3] = [12 — 3,184 3] 
= [9,21]. 
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For three standard deviations: 
p+3o0 =15+3-[2,3] =15+[6,9] =[15-9,154 9] 
= [6,24], 
or we could compute it as 
[9,21] + [2,3] = [9 — 3,21 + 3] = [6,24], 
and approximately 97,7% of values lie between 
x € [6, 24]. 

The area between the lowest and the highest 
curve for each portion represents the burden 
(indeterminacy) of the graph. 

The neutrosophic normal distribution can be 
regarded as a bell-shape curve with heavy margins. 

A random variable X that has a neutrosophic 
normal distribution is called a neutrosophic 
normalvariable. 

A second neutrosophic examplefor normal 
distributionwhere p = [15,17] and o = 2, hence now 
p is indeterminate. 


Neutrosophic Probability Density 


Similar discussion for the second example: 
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Within one standard deviation, i.e. 
uwto = [15,17] +2 = [15 —2,17 + 2] = [13,19], 
approximately 68% of values lie between x € 
[13, 19]. 
Within two standard deviations, i.e. 
wt+2o0 = [15,17] +2-2 = [15,17] +4 = [15 —4,17 + 4] 
= [11,21], 
or computed as 
[13,19] +o = [13,19] + 2 = [13 — 2,19 + 2] = [11, 21]. 
And within three standard deviations, i.e. 
u+3o = [15,17] +3-2 =[15,17]) +6 =[15-6,17+ 6] 
= [9,23], 
or computed as 
[11,21] +2 = [11 — 2,21 + 2] +2 = [9, 23], 
approximately 97.7% of values lie between 
x € [9, 23]. 

A third neutrosophic example of normal 
distribution with yw = [15,17] and o = [2,3], hence 
double indeterminacy, combines the previous 
second graph with the first one. 

Of course, the vagueness becomes wider! 

With p = [15,17] and o = [2,3], we get: 

Within one standard deviation of the mean, i.e. 

uto = [15,17] + [2,3] = [15 — 3,17 + 3] = [12, 20], 
approximately 68% of values lie between 
x € [12,20]. 
Within two standard deviations of the mean, i.e. 
u+2o0 = [15,17] + 2-[2,3] = [15,17] + [4,6] 
= [15 — 6,17 + 6] = [9,23], 
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or computed as [12, 20] + [2,3] = [12 —3,20+ 
3] = [9, 23], 

approximately 95.4% of values lie between x € 
[9,23]. 

And within three standard deviations of the 
mean, i.e. 

u+ 36 = [15,17] +3 [2,3] = [15,17] + [6,9] 
= [15 -—9,17+9] = [6, 26], 
or computed as [9,23]+[2,3] =[9-3,234+3]= 
[6, 26], 
approximately 97.7% of values lie between 
x € [6, 26]. 


Neutrosophication of Other Distributions. 


In the same way, replacing one or more 
distribution parameters by a set, we can extend the 
classical distributions, such as: standard normal 
distribution, bivariate normal distribution, uniform 
distribution, sampling distribution, geometric 
distribution, hypergeometric distribution, Poisson 
distribution, chi-squared distribution, exponential 
distribution, frequency distribution, Pareto 
distribution, t-distribution, etc. to their 
corresponding neutrosophic versions. 

The set replacing a crisp parameter may have two 
or more elements, or may be empty (the last case 
meaning that the parameter is unknown). 
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A Neutrosophic 
Hypothesis 


A Neutrosophic Hypothesis is a statement about 
the neutrosophic values of a single or several 
population characteristics. 

The distinction between the classical (statistics) 
hypothesis and neutrosophic hypothesis is that in 
the neutrosophic statistics the variables that 
describe the population characteristics are 
neutrosophic (i.e. they have some indeterminate 
values, or several unknown values, or an inexact 
number of terms if the variable is discrete), or for 
the values that we compare at least one of the 
population characteristics is neutrosophic (i.e. 
indeterminate or unclear or vague value). 

Similarly to the classical statistics, a 
Neutrosophic Null Hypothesis, denoted by NHo, is 
the statement that is initially assumed to be true. 
While the Neutrosophic Alternative Hypothesis, 
denoted by NHa, is the other hypothesis. 

In carrying out a test of NHo versus NH, there are 
two possible conclusions: reject NHo (if sample 
evidence suggest strongly that NHo is false), or fail 
to reject NHo (if the sample does not support string 
evidence against NHo). 

Examples: 

NHo: 1 € [90, 100] 
NH,:p < 90 
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NH: > 100 
NH,:p € [90, 100], 
where p represents the classical average IQ of all 
children born since 1st January 2001. 


NHo: 7m = 0.2 or 0.3 
NHq:tt < 0.2 
NHqitt > 0.3 
NH: 1 € (0.2, 0.3) 
NH,:p € {0.2, 0.3}, 
where rm represents the classical proportion of all 
Ford cars that need repair while under first year of 
warranty. 


NH: p < 0.1 orp > 0.9 
NHa:p = 0.1 
NHa:p = 0.9 
NHo: p > 0.1 and p < 0.9 
NHa:p € [0.1, 0.9], 
where p represents the classical proportion of 
outliers in a human population with respect to their 
height, i.e. percentage of people whose height is less 
than 150 cm, or percentage of people whose height 
is greater than 190 cm. 
Neutrosophic Outliers are noticeably unusual 
values in the neutrosophic data; they can be crisp 
values or neutrosophic values. 


No? (ee el [0.45, 0.55], 


which is equivalent to 
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Pin > 9-49 
and 

Pax > 9:99 
where pi represents a neutrosophic percentage 
average of all electronic devices that get morally 
depreciated after three years from their fabrication; 
[Minin max! 3S a meutrosophic value (rough 
approximation). 
= 0.45 
= 0.55 


ax 


1 < 0.45 
< 0.55 


max 


< 0.45 orp < 0.45. 


max 


NHq 
NHg 
NHg 
NH, 
NHg:p 


‘Pmin 
sg lee 
olen 
i 
min 


NHo: u = 7.0 
NHq: fh < 7.0 
NHq: pf > 7.0 
NHa: fb # 7.0 
A manufacturing plant made an approximate 
survey of its selling, survey done by two 
independent observes on different samples of same 
size. Their findings are close, yet different. The 
owner of the manufacturing plant decided to put 
both results together, taking for each period the 
[min, max] or [inf, sup] interval, in order to see the 
fluctuation of sales. The variable x that describes 
the survey is thus a neutrosophic one: 
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Period Sold Quantity (in thousands) 
[4, 6] 

2001 

2002 [7,8] 

2003 5.) OF 6:0 

2004 (8.0, 8.8) 

2005 (OS) 


The null hypothesis that the average annual 
selling p = 7.0 is in the classical style, but the 
variable x that p is referring to is neutrosophic. 

So we still have a neutrosophic hypothesis. 


Neutrosophic Hypothesis Testing Errors. 


A census of a large population is hard or even 
impossible to due. That’s why we have to use 
samples. The inference we are making from a 
neutrosophic sample characteristic to a population 
characteristic is subject to error. 

Similarly to classical statistics, there are two 
types of errors: 

1. Neutrosophic Type I Error, which is the error 
of rejecting NHo when NHp is true. 

2. Neutrosophic Type II Error, which is the 
opposite of the previous error, i.e. the error of not 
rejecting NHo when NH is false. 
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No matter what test we do, there is some chance 
that a neutrosophic type I error will be made, and 
there is some chance that a neutrosophic type II 
error will be made too. 

For example, rejecting the hypothesis Ho: p = 7.0 
when it is true in one of the previous examples, 
would determine the owner of the manufacturing 
plant to take additional adjustments and spending 
money when not really needed. 

While accepting Ho: pp = 7.0 when it is false, will 
damage the future selling. 

Probabilities of neutrosophic type I error and type 
II error are denoted by an (level of significance) and 
respectively Bn. 

Dealing with neutrosophic probabilities, an and 
Bn can be subsets of the interval [O, 1]. The ideal test 
procedure would havean = By= 0, or anand Bnas tiny 
intervals near zero. 

For example, ifan= [0.07,0.10]in a test procedure, 
done with different samples, over and over, a true 
hypothesis Ho is rejected about 7, 8, 9, or 10 times 
in a hundred. 

If By = [0.07,0.10],then a false hypothesis Ho is 
accepted about 7-10 times in a hundred. 


Example. 

A car manufacturer pretends that between 80% 
and 90% of its car need no repair during the first 2 
years of driving. In order to check the claim, a 
consumer agency obtains a random sample of 50 
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purchasers and investigate them whether or not 
their cars needed repair during the first 2 years of 
driving. Let p denote the sample proportion of 
responses that indicate no repair, and let m denote 
the true proportion of no repairs (called successes). 
The appropriate neutrosophic hypotheses are: 
NHo:1 € [0.8,0.9] versus NHg:7 < 0.8 

in order to check if the sample evidence suggests 
that mz < 0.9. 

Neutrosophic Type I Error is to consider the car 
manufacturer’s claim fallacious (i.e. m < 0.8) while 
in fact it is correct. 

And Neutrosophic Type II Error if the consumer 
agency fails to detect the manufacturer’s incorrect 
claim. 

For avoiding serious consequences the consumer 
agency decides a type I error probability of 
[0.01, 0.05] but no larger can be tolerated. So 

a =[0.01,0.05] is used for developing a test 
procedure. 

We recall, from classical statistics, that a 
classical standard normal distribution of a random 
variable z, is a normal distribution with the mean 
value 

u=0 
and standard deviation 
o=1. 

Its corresponding curve is called standard 
normal curve or z curve. 
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A z critical value captures the lower-tail or 
upper-tail area, or the central area under the z 


curve. 
The table of the most used Z critical values in 


classical statistics: 


Critical Area to the Area to the Area 
value, z right of z left of -z between -z 
and z 
28 “0 a) .80 
1.645 O05 .O5 .90 
1.96 1025 .025 95 
pas .O1 .O1 .98 
2.56 .005 .005 .99 
3.09 .001 .001 .998 
3.29 .0005 .0O005 .999 


A normally distributed random variable x can 
bestandardized as 


where p = x's mean value, 
and o = x's standard distribution. 
If the neutrosophic null hypothesis about 
variable x is: 
NHo:p € [a, b], 
where [a,b], with a<b, is the hypothesized 
interval, then the neutrosophic test statistic is: 
x —[a.b] 
oe a 
s/Vn 
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where x is the sample mean, 
sis the sample standard deviation, 
and nis the sample size, with n > 30. 


Variable z has approximately a neutrosophic 
standard normal distribution. 

In neutrosophic statitics, x, s and even ncan be 
sets (not necessarily crisp numbers). 


Alternative Hypotheses. 

Ha: >b; Reject Ho if minz>z critical value 
(upper-tailed test); 

Ha:u<a; Reject Hy if maxz < —z critical value 
(lower-tailed test); 

H,:u¢€ [a,b]; Reject Hp if: either minz > z critical 
value, or maxz < —zcritical value (two-tailed test). 


Example. 

Let’s consider the exam-anxiety scores for a 
sample of an American College students were the 
following: 

n = 64,x = [48.0,50.0], and s = 25. 

Then p = true mean exam-anxiety. 

Ho: € [40.0, 41.0] 
Hail > 41.0. 
The neutrosophic test statistics is: 
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[48.0, 50.0] — [40.0,41.0] [48.0 — 41.0,50.0 — 40.0] 
a 


25/V64 25/8 
_ [7.0,10.0] _ 8-[7.0,10.0] _ [56.0, 80.0] 
25/8 25 - 25 
756.0 80.0) _ 
=\5e oe |= [2.24, 3.20]. 


For a=0.10 the corresponding one-tailed z 
critical value from the previous table is 1.28. Hence 
Ho is rejected because z = [2.24,3.20] > 1.28. 

In conclusion, the mean exam-anxiety score is 
higher than 41.0. 
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The Neutrosophic Level of 
Significance 


The Neutrosophic Level of Significance a may 
be a set, not necessarily a crisp number as in 
classical statistics. 

For example, a, = [0.01,0.10]is a neutrosophic 
level of significance a, where a varies in the interval 
[0.01, 0.10]. 

A Neutrosophic P-Value is defined in the same 
way as in classical statistics: the smallest level of 
significance at which a null hypothesis Hj) can be 
rejected. 

The distinction between classical P-value and 
neutrosophic P-value is that the neutrosophic P- 
value is not acrisp number as in classical statistics, 
but a set (in many applications it is an interval). 

Neutrosophic P-Value = P(z > critical value, 
whenHpis true), where P(:) means _ classical 
probability calculated assuming that Hp) is true, 
probability of observing a test statistic value being 
more extreme than is was actually obtained. 

Suppose one has calculated the neutrosophic P- 
value at the particular level of significance a, where 
a is a crisp positive number. 

1. If max{neutrosophicP — value} < a, then reject 
Hy at level a. 

2. If min{neutrosophicP — value} > a, then do not 
reject Hy, at level a. 
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3. If min{neutrosophicP — value}<a< 
max{neutrosophicP — value} then there is an 
indeterminacy. Thus 

a — min{neutrosophicP — value} 
max{neutrosophicP — value} — min{neutrosophicP — value} 

is the chance of rejecting Hp at level a, 

and 

max{neutrosophicP — value}—a 
max{neutrosophicP — value} — min{neutrosophicP — value} 

is the chance of not rejecting Hg at level a. 

Let ay be a set. 

4. If max{neutrosophicP — value} < min{ay}, then 
reject Hy at level ay. 

5. If min{neutrosophicP — value} > max{ay}, then 
do not reject Hy at level ay. 

6. If the two sets, those of the neutrosophic P- 
value and of the neutrosophic level of significance 
ay intersect, one has indeterminacy. And one can 
compute the chance of rejecting Hy at level ay, and 
the chance of not rejecting Hp at level ay. 

In classical statistics, the P-value is computed 
considering theTable of Standard Normal 
Probabilities. 

a. P-value is the area under the z curve to the 
right of computed z, for Upper-tailed z test. 

b. P-value is the area under the z-curve to the 
left of computed z, forLower-tailed z test. 

c. P-value is twice the area captured in the tail 
corresponding to the computed 2, for Two-tailed z 
test. 
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Let’s insert from the classical statistics the 
Standard Normal Cumulative Probability Table [for 
positive z-values only, since this is needed in our 
below example]: 


Standard Normal Cumulative Probability Table ra 


Cumulative probabilities for POSITIVE z-values are shown in the following table: —— t = 


0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 


In the previous example, 
Ho -we[40.0, 41.0] versus Hg: p > 41.0, 
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we found the neutrosophic z = [2.24,3.20]. We have 
an Upper-tailed z test. 

From the above Table of Standard Normal 
Probabilities, the area under the z curve to the right 
ofz, = 2.24 is 1 — 0.9875 = 0.0125 

while for 

Zz = 3.20 is 1 — 0.9993 = 0.0007. 

Thus, the neutrosophic 

P — value = [0.0007, 0.0125]. 

At the level of significance a, = 0.10, reject Ho 

since 
max[0.0007,0.0125] = 0.0125 < 0.10. 

At the level of significance az = 0.0005, do not 

reject Hy since 
max|0.0007, 0.0125] = 0.0125 > 0.0005. 

At the level of significance a3; = 0.01, one has 
indeterminacy since 

0.01 € [0.0007, 0.0125]; therefore: 

chance of rejecting Hp at level a3 = 0.01 is 

0.01—0.0007 — 0.0093 


0.0125 —0.0007 0.0118 
and chance of not rejecting Hp at level a3 = 0.01 


= 79% 


is 
0.0125 —0.01 = 0.0025 


a DOG: 
0.0125—0.07 0.0118 A 
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The Neutrosophic 
Confidence Interval 


The Neutrosophic Confidence Interval for a 
population characteristics is defined, similarly to 
the classical statistics, as an interval of plausible 
neutrosophic values of the characteristic. 

The neutrosophic value of the characteristic is 
captured inside the interval with a chosen degree of 
confidence. 

A confidence level is associated with each 
neutrosophic confidence interval, as in classical 
statistics. It tells us how much confidence we have 
in procedure used in constructing the neutrosophic 
confidence interval. 

The classical formulas for the confidence interval 
are extended from crisp variables to neutrosophic 
variables (i.e. variables whose values are sets): 

1. When the neutrosophic value of the 
population standard deviation o is known, the 
Large-Sample Neutrosophic Confidence Interval 
for the Population Mean p is: 


o 

xX + (zcritical value) - a 

where x is the large-sample neutrosophic mean, 
and n is the neutrosophic size of the large-sample. 

Thereforex, 0, and/or n may be sets instead of 


crisp numbers. 
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2. When the neutrosophic value of the 
population standard deviation o is unknown (as in 
most practical applications), and the sample size 
exceeds 30, one uses the sample standard deviation 
s instead of o for computing the Neutrosophic 
Confidence Interval for the Population Mean rp: 

xX + (Zcritical value) ae 

For both formulas, the Zz critical value 1.645 
corresponds to the confidence level of 90%, the z 
critical value 1.96 corresponds to the confidence 
level of 95%, and the z critical value 2,58 
corresponds to the confidence level of 99%, 
similarly as in classical statistics. 

The confidence level of, for example, 90% does 
not refer to the chance that the population mean p 
is captured in an interval, but to the percentage of 
all possible successful samples (i.e. samples for 
which p is included in the confidence interval). 

An Example. 

Many individuals partially loose vision because 
of exposure to dust. 

On a study involving 60 people (a sample), that 
were constantly exposed to dust to _ their 
construction work places, in average they lost 
18%-20% of their vision accuracy, with a sample 
standard deviation of 4%-5%. 

The study investigator wishes a 90% confidence 
interval for p. Hence: 

x = [18, 20] 
zcritical value = 1.645 
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s = [4,5] 
n = 60. 
Therefore, the neutrosophic confidence interval 
for the population mean 1p is: 
[4,5] 
¥60 


= [18,20] + 


[18, 20] + (1.645) - 


1.645(4) 1.645(5) 
v60  +=—-V60 
= [18,20] + [0.85, 1.06]. 
Let’s split into two parts: 
[18, 20] + [0.85, 1.06] = [18 + 0.85, 20 + 1.06] 
= [18.85, 21.06], 
and 
[18, 20] — [0.85, 1.06] = [18 — 1.06, 20 — 0.85] 
= [16.94, 19.15]. 
Combining these two cases we get the 
neutrosophic confidence interval: 
[16.94, 21.06]. 
The Neutrosophic Sample Size to estimate, 
within the amount B, with c% confidence, of the 


population mean 1p is: 
(zcritical value):a 
i ar 

where z critical value should correspond to the 
c% confidence, 

o is the population standard variation, 

and ny is the resulting neutrosophic sample size, 
hence ny may be a set (especially an interval). 

For surety, we can take the sample size as 
[max{ny}], where | ] means superior integer part. 

Let’s see an Example. 
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The business department wishes to estimate the 
annual cost of office supplies for faculty at the 
University of New Mexico to be within $40 of the 
true population mean. The business department 
wants a 95% confidence in their result accuracy. 

How large should the sample be? 

Because o is not known, it can be approximated 


as 
_ Tange 
an 

as in classical statistics. 

Range is the difference between the highest and 
lowest costs. 

The amount spent on office supplies varied 
between $500-$550 to $100-$150. Then 

[500,550] — [100,150] [500 — 150,550 — 100] 


SS ———— es 
~ — 


4 4 
[350,450] 7350 450 
a har 
= [87.50, 137.50]. 

Further, B = 40, z critical value is 1.96, and: 
1.96[87.50, 137.50]]* —_71.96(87.50) 1.96(137.50)y7 
40 | 7 | 40 ’ 40 
= [4.2875, 6.7375]2 = [4.28752, 6.73757] 

~ [18.38, 45.39]. 


ny = 


Now 
[max[18.38, 45.39]] = [45.39] = 46. 
Therefore the sample size should be 46. 
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Large-Sample 
Neutrosophic Confidence 
Interval for the 
Population Proportion 


Using the classical statistics one can define (in 
the same way) the Large-Sample Neutrosophic 
Confidence Interval for the Population 
Proportion m: 


i= 
p + (zcritical value) - pe?) 


for the case when min{np}=>5 and min{n: (1 —- 
p)} 25, 

where 

p = sample proportion = number of sample 
individuals that possess the property of interest 
divided by sample’s size; 

n = sample’s size; 


It = population proportion = 
number of population individuals that possess the property of interest 


total number of population individuals , 


with the distinction from the classical statistics 


that in neutrosophic statistics the parameters p and 
n may be setsinstead of crisp numbers, and the z 
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critical value may be a set as well(for example it may 
be [1.645, 1.96], i.e. confidence level of [90, 95]%). 
The neutrosophic sample statistics p, for min{n} 
large enough, has a neutrosophic sampling 
distribution (normal curve) that approximates the 
population mean m and its standard deviation 


m(1-70) 
naa Pk 


Let’s see an Example. 

A survey on a sample of 200 — 220 consumers is 
done at a car dealer asking the following question: 
“Would you be willing to trade in your old car when 
buying a new car?” The number of yes’s was 150. 
The confidence level should be 99%. If m denotes the 
proportion of all consumers who would trade in 
their old cars, one may consider p a point estimate 
for I: 

150 150 150 

P (200, 201,...,220} — 550° 200 

The sample’s size {200,201,...,220} means that 
the surveyer was not sure about 20 people if they 
were or not custumers of this car dealer. So, the 
sample’s size is indeterminate (approximated by the 
set {200, 201, ...,220}), 

z critical value = 2.58. 

min{np} = min{{200, 201, ...,220} - [0.68, 0.75]} 
= 200(0.68) = 136 >5; 


= [0.68, 0.75]. 
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min{n(1 — p)} = min{{200, 201, ...220} 
-(1 — [0.68,0.75])} 
= 200: min([1 — 0.75, 1 — 0.68]) 
= 200 - min([0.25, 0.32]) = 200(0.25) 
=/50°> 5. 
The large-sample neutrosophic confidence 
interval for ris: 


[0.68, 0.75] - (1 — [0.68, 0.75]) 
{200, 201, ..., 220} 


= [0.68, 0.75] + 2.58 


[0.68, 0.75] + 2.58: 


[0.68, 0.75] - [0.25, 0.32] 
{200, 201, ..., 220} 


= [0.68, 0.75] + 2.58 


0.68(0.25) 0.75(0.32) 
220 ’ 200 


= [0.68, 0.75] + 2.58 


- /[0.000773, 0.001200] 
= [0.68, 0.75] + 2.58 
-¥0.000773, V0.001200 
= [0.68,0.75] + 2.58 
- [0.027803, 0.034641] 
= [0.68, 0.75] + [0.071732, 0.089374]. 
Split it into two parts: 
[0.68, 0.75] + [0.71732, 0.089374] 
= [0.751732, 0.839374], 
and 
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(0.68, 0.75] — [0.071732, 0.089374] 
= [0.68 — 0.089374, 0.75 — 0.071732] 
= [0.590626, 0.678268]. 

Combining both results in a conservative mode, 
we get: 

[0.590626, 0.839374]. 

The formula for choosing the neutrosophic 
sample size is the same as in classical statistics, but 
using sets instead of crisp numbers: 

zcritical value]? 
| B 

where B = the specific error bound. 

If m cannot be estimated using prior neutrosophic 
information, one uses m = 0.5 which gives a 
conservatively large sample value (i.e. a larger n 
than any other value of m would do). 


n=n(1-T 
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The Neutrosophic Central 
Limit Theorem 


The Neutrosophic Central Limit Theorem, which 
is an extension of the classical Central Limit 
Theorem, can be safely applied if min{n} exceeds 30, 
where n is the neutrosophic sample size (i.e. n may 
be a set). 

The Neutrosophic Central Limit Theorem states 
that the neutrosophic sampling distribution of x si 
approximated by a neutrosophic normal curve 
when min{n} is sufficiently large, no matter how is 
the population distribution. 

Of course, if the population distribution is 
normal, then min{n} may be smaller than 30, and 
the neutrosophic sampling distribution of x is 
normal too for any neutrosophic sample size n. But, 
if the population distribution is not normal, then 
min{n} should be greater than 30, and the 
neutrosophic sampling distribution of x is only an 
approximation to the normal curve: the larger is 
min{n}, the better approach. 


The last result has enabled the neutrosophic 
statisticians in order to infer a population mean, to 
develop large sample neutrosophic procedures even 
when one deals with an unknown shape of the 
population distribution. 
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Using similar notations: 

n = random neutrosophic sample size; 

x = neutrosophic mean of the sample size; 

i = population mean; 

o = population standard deviation; 

[iz = neutrosophic mean of the x distribution; 

and 

Oz = neutrosophic standard deviation of the x 
distribution; 

one has, as in classical statistics: 

Uz = H, 


and o; = — 
x ~~ In" 


The neutrosophic central limit theorem does not 
apply, as in classical statistics, when min{n} is 
small and the shape of the population distribution 
is unknown. 


Let’s introduce the Small-Sample Neutrosophic 
t Confidence Interval for the Mean of the Normal 
Population, which is just a neutrosophication of 
the classical one-sample t confidence interval for 
the population mean p: 


s 
xX + (tcritical values) - — 
7 vn 


where similarly: 
x = neutrosophic sample mean; 
s =neutrosophic sample standard deviation; 
n = neutrosophic sample size; 
and 
tcritical value is based on 
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min{n} — 1 degrees of freedom (df). 
x,s,andn may be sets instead of crisp numbers. 
For small min{n}, the neutrosophic t confidence 
interval for the population mean p is appropriate 
when the population distribution is normal or 
approximately normal. Otherwise, another method 
should be employed. 


The neutrosophic t distribution is more spread 
out, of course, than the neutrosophic standard 
normal (z) curve, because the use ofs, instead of 
population deviation 0, produces extra variability. 


The neutrosophic t distributions are 
distinguished from one another by the degree of 
freedom, which can be a positive integer greater 
than or equal to 
1, or a set of positive integers greater than or equal to 1, 
for example: 

{n,n+1,...,n +m}. 

The higher is min{n}, the closer the neutrosophic 
t distribution is to the neutrosophic z curve. For 
min{n} > 120 one may use the z critical values. The 
neutrosophic tcurve, for a fixed number of degrees 
of freedom, is in general bell-shaped and centered 
at zero in neutrosophic style way. 


An Example. 
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A small random sample of 18 workers, at the Rail 
Road, was investigated regarding the weights these 
workers are able to lift in their work place. The 
neutrosophic sample average found was x between 
8 kg and 10 kg, with a standard deviation s between 
3-4 kg. 

Let’s say a confidence level of 95% is required for 
capturing the population mean p.Thus: 

x = [8, 10|(an interval) 
s = [3,4](an interval) 
n= 18, 
hence a small sample size, which requires a 
neutrosophic t critical value based on 18-—-1= 
17 Gf. 

From the below classical statisticsTable of t 
Critical Values,we find out that for 95% confidence 
level and 17 df, the corresponding 

tcritical value = 2.11. 
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t Table 

Ce a a os Po ae om 
oneal} 0.50 0.25 020 0.15 010 0.05 0.025 0.01 0.005 0.001 0.0005 
twortails} 1.00 050 0.40 030 020 010 0.05 0.02 0.01 0.002 0.001 


on oo 
> 
c— 
B 
c— 
3 
oO 
— 
mm 
a 
Ss 
rm 
o> 
on 
rare) 
w 
jad 
oo 
mS 
~_ 
= 
on 
= 
on 
— 
nm 
ed 
a 
=_ 
RS 
fred 
mS 
= 


1.684 3307 3.56 
084 = 1.045 «1.296 «1671 «2000 «2.390 2660 «3232 «= 3.460 
0846 1.043 «1.292 «1664 «1990 2374 «2639 3.15 3.416 
084 1.042 1290 1660 1984 2364 2626 3.474 = 3.390 
0842 «1.037 = 1.282 = 1646 = 1962 2.330 2881 3.088» 3.300 


0% 50% 60% 70% 80% 90% 95% 98% 99% 99.8% 99.9% 
Confidence Level 
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Apply the previous formula: 


+ (tcritical value) :—= = [8 io) 420124 
x wv (Utcritical value a : T 24. V18 
S 2.11(3) 2.11(4) 


= [8,10] + [1.492, 1.989]. 
Split the calculation into two possibilities: 
[8, 10] + [1.492, 1.989] = [8 + 1.492, 10 + 1.989] 
= [9.492, 11.989], 
and 
[8, 10] — [1.492, 1.989] = [8 — 1.989, 10 — 1.492] 
= [6.011, 8.508]. 

Now we combine both results in a conservative 
way, and we get the neutrosophic t confidence 
interval for the population average of weight lifting: 
(6.011, 11.989] kg. 


BeatriceGloria_personal library 
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